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(57) Abstract: This invention relates to a method to promote the differentiation ol stem cells, typically embryonic stem cells, through 
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Method for Modulating Stem Cell Differentiation Using Stem Loop RNA 

The invention Telates to a method to modulate stem cell differentiation comprising 
introducing stem loop containing RNA into a stem cell to ablate mRNA's which 
5 encode polypeptides which are involved in stem cell differentiation; stem loop 
RNA's ; and nucleic acid molecules and vectors encoding stem loop RNA's. 

A number of techniques have been developed in recent years which purport to 
specifically ablate genes and/or gene products. For example, the use of anti-sense 

10 nucleic acid molecules to bind to and thereby block or inactivate target mRNA 
molecules is an effective means to inhibit the production of gene products. This is 
typically very effective in plants where anti-sense technology produces a number of 
striking phenotypic characteristics. However, antisense is variable leading to the 
need to screen many, sometimes hundreds of, transgenic organisms carrying one or 

15 more copies of an antisense transgene to ensure that the phenotype is indeed truly 
linked to the antisense transgene expression. Antisense techniques, not necessarily 
involving the production of stable transfectants, have been applied to cells in culture, 
with variable results. 

20 In addition, the ability to be able to disrupt genes via homologous recombination has 
provided biologists with a crucial tool in defining developmental pathways in higher 
organisms. The use of mouse gene "knock out" strains has allowed the dissection of 
gene function and the probable function of human homologues to the deleted mouse 
genes, (Jordan and Zant, 1998). 

25 

A much more recent technique to specifically ablate gene function is through the 
introduction of double stranded RNA, also referred to as inhibitory RNA (RNAi), 
into a cell which results in the destruction of mRNA complementary to the sequence 
included in the RNAi molecule. The RNAi molecule comprises two complementary 
30 strands of RNA (a sense strand and an antisense strand) annealed to each other to 
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form a double stranded RNA molecule. The RNAi molecule is typically derived 
from exonic or coding sequence of the gene which is to be ablated. 

Surprisingly, only a few molecules of RNAi are required to block gene expression 
5 which implies the mechanism is catalytic. The site of action appears to be nuclear as 
little if any RNAi is detectable in the cytoplasm of cells indicating that RNAi exerts 
its effect during mRNA synthesis or processing. 

The exact mechanism of RNAi action is unknown although there are theories to 
10 explain this phenomenon. For example, all organisms have evolved protective 
mechanisms to limit the effects of exogenous gene expression. For example, a virus 
often causes deleterious effects on the organism it infects. Viral gene expression 
and/or replicationtherefore needs to be repressed. In addition, the rapid development 
of genetic transformation and the provision of transgenic plants and animals has led 
15 to the realisation that transgenes are also recognised as foreign nucleic acid and 
subjected to phenomena variously called quelling (Singer and Selker, 1995), gene 
silencing (Matzke and Matzke, 1998) , and co-suppression (Stain et. al., 2000). 

Initial studies using RNAi used the nematode Caenorhabditis elegans. RNAi 
20 injected into the worm resulted in the disappearance of polypeptides corresponding to 
the gene sequences comprising the RNAi molecule(Montgomery et. al., 1998; Fire et. 
al., 1998). More recently the phenomenon of RNAi inhibition has been shown in a 
number of eukaryotes including, by example and not by way of limitation, plants, 
trypanosomes (Shi et. al., 2000) Drosophila spp. (Kennerdell and Carthew, 2000). 
25 Recent experiments have shown that RNAi may also function in higher eukaryotes. 
For example, it has been shown that RNAi can ablate c-mos in a mouse ooctye and 
also E-cadherin in a mouse preimplanation embryo (Wianny and Zernicka-Goetz, 
2000). 

30 The use of RNAi to ablate stem cell RNA is disclosed in our co-pending application, 
WO 02/16620, which is incorporated by reference. 
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During mammalian development those cells that form part of the embryo up until the 
formation of the blastocyst are said to be totipotent (e.g. each cell has the 
developmental potential to form a complete embryo and all the cells required to 
5 support the growth and development of said embryo). During the formation of the 
blastocyst, the cells that comprise the inner cell mass are said to be pluripotential 
(e.g. each cell has the developmental potential to form a variety of tissues). 

Embryonic stem cells (ES cells, those with pluripotentiality) may be principally 
10 derived from two embryonic sources. Cells isolated from the inner cell mass are 
termed embryonic stem (ES) cells. In the laboratory mouse, similar cells can be 
derived from the culture of primordial germ cells isolated from the mesenteries or 
genital ridges of days 8.5-12.5 post coitum embryos. These would ultimately 
differentiate into germ cells and are referred to as embryonic germ cells (EG cells). 
15 Each of these types of pluripotential cell has a similar developmental potential with 
respect to differentiation into alternate cell types, but possible differences in 
behaviour (eg with respect to imprinting) have led to these cells to be distinguished 
from one another . 

20 Typically ES/EG cell cultures have well defined characteristics. These include, but 
are not limited to; 

i) maintenance in culture for at least 20 passages when maintained on fibroblast 
feeder layers; 

25 ii) produce clusters of cells in culture referred to as embryoid bodies; 

iii) ability to differentiate into multiple cell types in monolayer culture; 

iv) can form embryo chimeras when mixed with an embryo host; 

v) express ES/EG cell specific markers. 

30 Until very recently, in vitro culture of human ES/EG cells was not possible. The first 
indication that conditions may be determined which could allow the establishment of 
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human ES/EG cells in culture is described in W096/22362. The application 
describes cell lines and growth conditions which allow the continuous proliferation 
of primate ES cells which exhibit a range of characteristics or markers which are 
associated with stem cells having pluripotent characteristics. 

5 

More recently Thomson et al (1998) have published conditions in which human ES 
cells can be established in culture. The above characteristics shown by primate ES 
cells are also shown by the human ES cell lines. In addition the human cell lines 
show high levels of telomerase activity, a characteristic of cells which have the 

10 ability to divide continuously in culture in an undifferentiated state. Another group 
(Reubinoff et. al., 2000) have also reported the derivation of human ES cells from 
human blastocyts. Shamblott et al, 1998 have also described EG cell derivation. In 
Lake et al J Cell Science 2000, 113:555-66 and Rathjen et al J Cell Science 1999, 
112: 601-12, ectodermal stem cells are disclosed. The above references are each both 

1 5 incorporated by reference in their entirety. 

A feature of ES/EG cells is that, in the presence of fibroblast feeder layers, they 
retain the ability to divide in an undifferentiated state for several generations. If the 
feeder layers are removed then the cells differentiate. The differentiation is often to 
20 neurones or muscle cells but the exact mechanism by which this occurs and its 
control remain unsolved. 

In addition to ES/EG cells a number of adult tissues contain cells with stem cell 
characteristics. Typically these cells, although retaining the ability to differentiate 

25 into different cell types, do not have the pluripotential characteristics of ES/EG cells. 
For example haemopoietic stem cells have the potential to form all the cells of the 
haemopoietic system (red blood cells, macrophages, basophils, eosinophils etc). All 
of nerve tissue, skin and muscle retain pools of cells with stem cell potential. 
Therefore, in addition to the use of embryonic stem cells in developmental biology, 

30 there are also adult stem cells which may also have utility with respect to determining 
the factors which govern cell differentiation. . Further recent studies have suggested 
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that some stem cells previously thought to be committed to a single fate, (e.g 
neurons) may indeed possess considerable pluripotentcy in certain situations. Neural 
stem cells have recently been shown to chimerise a mouse embryo and form a wide 
range of non-neural tissue (Clark et. al., 2000). 

5 

A further group of cells which have relevance to developmental biology are 
pluripotent embryonal carcinoma cells (EC cells) which are stem cells of 
teratocarcinomas, also referred to as teratomas, which are able to differentiate into all 
cell types found in these tumours. A teratocarcinoma also includes teratocarcinoma 
10 cells which do not have the full pluripotential characteristics of an EC cell but 
nevertheless can differentiate into a restricted number of differentiated tissues. These 
cells have many features in common with ES/EG cells. The most important of these 
features is the characteristic of pluripotentiality. 

15 Teratomas contain a wide range of differentiated tissues, and have been known in 
humans for many hundreds of years. They typically occur as gonadal tumours of 
both men and women. The gonadal forms of these tumours are generally believed to 
originate from germ cells, and the extra gonadal forms, which typically have the 
same range of tissues, are thought to arise from germ cells that have migrated 

20 incorrectly during embryogenesis. Teratomas are therefore generally classed as germ 
cell tumours which encompasses a number of different types of cancer. These include 
seminoma, embryonal carcinoma, yolk sac carcinoma and choriocarcinoma. 

The similar biology of EC cells with ES/EG cells has been exploited to study the 
25 developmental fates of cells and to identify cell markers commonly expressed in EC 
cells and ES/EG cells. For example, and not by way of limitation, the expression of 
specific cell surface markers SSEA-3 (+), SSEA-4 (+), TRA-1-60 (+), TRA-1-81 (+) 
(Shevinsky et al 1982; Kannagi et al 1983; Andrews et al 1984a; Thomson et al 
1995); alkaline phosphatase (+) (Andrews et. al., 1996); and Oct 4 (Scholer et. al., 
30 1989; Kraft et. al, 1996; Reubinoff et. al, 2000; Yeom et. al, 1996). 
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We have accumulated expression studies which identify a number of genes thought 
to be involved in determining the developmental fate of stem cells, particularly 
embryonic stem cells. By northern blotting we have identified the expression of 
human homologs of two signalling pathways believed to be critical in cell fate 

5 determination. Expression of ligands, receptors and downstream components of the 
Notch and Wingless signalling cascades have been elucidated. Using the model 
system NTERA2/D1 embryonal carcinoma cells we have recorded changes in the 
expression of some of these components as the cells differentiate. Bearing in mind 
the role these cascades play in embryonic development throughout the animal 

10 kingdom, these changes suggest a significant role for both the wingless and Notch 
signalling pathways in differentiation of stem cells. Furthermore the activity of some 
genes are required for differentiation to occur along specific pathways e.g. the 
myogenic gene MyoDl. Other genes have activity which inhibits cellular 
differentiation along particular pathways. We envisage regulation of stem cell 

1 5 differentiation to yield a specific cell type could be achieved by: 

(i) inhibition of certain genes that normally promote differentiation along 
particular pathways; therefore promoting differentiation to alternate cell 
phenotypes; 

20 (ii) inhibition of gene activity that prevents differentiation into particular cell 
types; and 

(iii) a combination of (i) and (ii), see figure 1 



25 In our co-pending application, WO02/16620, we introduce KNAi molecules 
homologous to genes encoding factors involved in stem cell differentiation. The 
differentiation of stem cells during embryogenesis, during tissue renewal in the adult 
and wound repair is under very stringent regulation; aberrations in this regulation 
underlie the formation of birth defects during development and are thought to 

30 underlie cancer formation in adults. 
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Generally, it is envisaged that stem cells are under both positive and negative 
regulation which allows a fine degree of control over the process of cell proliferation 
and cell differentiation: excess proliferation at the expense of cell differentiation can 
lead to the formation of an expanding mass of tissue - a cancer - whereas express 
5 differentiation at the expense of proliferation can lead to the loss of stem cells and 
production of too little differentiated tissue in the long term, and especially the loss 
of regenerative potential Certain genes have already been identified to have a 
negative role in preventing stem cell differentiation. Such genes, like those of the 
Notch family, when mutated to acquire activity can inhibit differentiation; such 
10 mutant genes act as oncogenes. On the contrary, loss of function of such genes on 
their inhibition results in stem cell differentiation. 

We propose to use EC cells has a model cell system to follow the effects of 
perturbations in stem cell differentiation. We further propose an alternative approach 
15 to introduce double stranded RNA molecules into stem cells to ablate mRNA's. 

The invention relates to the provision of stem-loop RNA structures which can either 
be synthesised in vitro followed by transfection into a stem cell, or alternatively, 
synthesised in vivo by the stem cell from vectors which are provided with expression 
20 cassettes which include a DNA molecule which includes the coding sequence for the 
stem-loop RNA. 

The DNA molecule encoding the stem-loop RNA is constructed in two parts, a first 
part which is derived from a gene the regulation of which is desired. The second part 

25 is provided with a DNA sequence which is complementary to the sequence of the 
first part. The cassette is typically under the control of a promoter which transcribes 
the DNA into RNA The complementary nature of the first and second parts of the 
RNA molecule results in base pairing over at least part of the length of the RNA 
molecule to form a double stranded hairpin RNA structure or stem-loop. The first 

30 and second parts can be provided with a linker sequence. 
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According to a first aspect of the invention there is provided a method to modulate 
the differentiation state of a stem cell comprising: 

(i) contacting a stem cell with at least one nucleic acid molecule comprising a 
5 sequence of a gene which mediates at least one step in the differentiation of said cell 
which nucleic acid molecule consists of a first part linked to a second part wherein 
said first and second parts are complementary over at least part of their length and 
further wherein said first and second parts form a double stranded region by 
complementary base pairing over at least part of their length; 
10 (ii) providing conditions conducive to the growth and differentiation of the cell 
treated in (i) above; and optionally 

(iii) maintaining and/or storing the cell in a differentiated state. 

In a preferred method of the invention said first and second parts are linked by at 
1 5 least one nucleotide base. 

The provision of first and second sequences which are complementary to one another 
and which comprise at least part of the coding sequence of a gene involved in stem 
cell differentiation means that when the sequence is transcribed into RNA the 

20 complementarity between first and second sequences allows base pairing between 
first and second sequences to form a double stranded RNA structure, see Figure 1. 
The optional provision of a linking region bewteen first and second parts results in 
the formation of a so called "hair-pin" loop structure. The transcription of the 
nucleic acid provides many copies of the hair-pin loop RNA which effectively 

25 functions as a RNAi molecule. 

In a preferred method of the invention said nucleic acid molecule is a stem loop RNA 
molecule. Alternatively, said nucleic acid molecule is a DNA molecule which 
encodes said stem loop RNA. Ideally said DNA molecule is a vector adapted for 
30 expression of said stem loop RNA. 



8 



WO 03/012082 



PCT/GB02/03409 



The stem cell in (i) above may be a teratocarcinoma cell. 

In a preferred method of the invention said conditions are in vitro cell culture 
conditions. 

5 

In a further preferred method of the invention said stem cell is selected from: 
pluripotent stem cells such as embryonic stem cell; embryonic germ cell and 
embryonal carcinoma cells; and lineage restricted stem cells such as, but not 
restricted to; haemopoietic stem cell; muscle stem cell; nerve stem cell; skin dermal 
1 0 sheath stem cell; liver stem cell; and teratocarcinoma cells. 

It will be apparent that the method can provide stem cells of intermediate 
commitment. For example, embryonic stem cells could be programmed to 
differentiate into haemopoietic stems cells with a restricted commitment. 
15 Alternatively, differentiated cells or stem cells of intermediate commitment could be 
reprogrammed to a more pluripotential state from which other differentiated cell 
lineages can be derived. 

In a further preferred method of the invention said stem cell is an embryonic stem 
20 cell or embryonic germ cell. 

In a yet further preferred method of the invention said stem loop RNA molecule is 
derived from a gene which encodes a cell surface receptor expressed by a stem cell. 

25 In a further preferred method of the invention said cell surface receptor is selected 
from: human Notch l(hNotch 1); hNotch 2; hNotch 3; hNotch 4; TLE-1; TLE-2; 
TLE-3; TLE-4; TCF7; TCF7L1; TCFFL2; TCF3; TCF19; TCF1; mFringe; lFringe; 
rFringe; sel 1; Numb; Numblike; LNX; FZD1; FZD2; FZD3; FZD4; FZD5; FZD6; 
FZD7; FZD8; FZD9; FZD10; FRZB. 

30 
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In an alternative preferred method of the invention said stem loop RNA molecule is 
derived from a gene which encodes a ligand. 

Typically, a ligand is a polypeptide which binds to a cognate receptor to induce or 
5 inhibit an intracellular or intercellular response. Ligands may be soluble or 
membrane bound. 

In a further alternative preferred method of the invention said ligand is selected from: 
DIM; D113; D114; Dlk-1; Jagged 1; Jagged 2; Wnt 1; Wnt 2; Wnt 2b; Wnt 3; Wnt 
10 3a; Wnt5a; Wnt6; Wnt7a; Wnt7b; Wnt8a; Wnt8b; WntlOb; Wntll; Wntl4; Wntl5. 

Alternatively, said gene is selected from: SFRP1; SFRP2; SFRP4; SFRP5; SK; 
DKK3; CER1; WIF-1; DVL1; DVL2; DVL3; DVLlLl;mFringe; lFringe; rFringe; 
selll; Numb; LNX Oct4;NeuroDl; NeuroD2; NeuroD3; Brachyury; MDFL 

15 

In a further prefened method of the invention said stem loop RNA molecule is 
derived from at least one of the sequences identified in Table 4 or Figures 4-54. 

In a yet futher preferred embodiment of the invention said sequence is derived from 
20 Oct 4. Preferably the Oct 4 sequence corresponds to nucleotide sequence about 610 
to about 1032 of the Oct 4 sequence found in GenBank accession number NM_ 
002701. 

Many methods have been developed over the last 30 years to facilitate the 
25 introduction of nucleic acid into cells which are well known in the art and are 
applicable to the stem loop RNA structures disclosed herein or the vectors which 
encode said stem loop structures. 

Methods to introduce nucleic acid into cells typically involve the use of chemical 
30 reagents, cationic lipids or physical methods. Chemical methods which facilitate the 
uptake of DNA by cells include the use of DEAE -Dextran ( Vaheri and Pagano 
Science 175: p434) . DEAE-dextran is a negatively charged cation which associates 
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and introduces the nucleic acid into cells. Calcium phosphate is also a commonly 
used chemical agent which when co-precipitated with nucleic acid introduces the 
nucleic acid into cells (Graham et al Virology (1973) 52: p456). 

5 The use of cationic lipids (eg liposomes ( Feigner (1987) Proc.Natl.Acad.Sci USA, 
84:p7413) has become a common method. The cationic head of the lipid associates 
with the negatively charged nucleic acid backbone to be introduced. The lipid/nucleic 
acid complex associates with the cell membrane and fuses with the cell to introduce 
the associated nucleic acid into the cell. Liposome mediated nucleic acid transfer has 
10 several advantages over existing methods. For example, cells which are recalcitrant 
to traditional chemical methods are more easily transfected using liposome mediated 
transfer. 

More recently still, physical methods to introduce nucleic acid have become effective 
15 means to reproducibly transfect cells. Direct microinjection is one such method 
which can deliver nucleic acid directly to the nucleus of a cell ( Capecchi (1980) 
Cell, 22:p479). This allows the analysis of single cell transfectants. So called 
"biolistic" methods physically shoot nucleic acid into cells and/or organelles using a 
particle gun ( Neumann (1982) EMBO J, 1: p841). Electroporation is arguably the 
20 most popular method to transfect nucleic acid. The method involves the use of a 
high voltage electrical charge to momentarily permeabilise cell membranes making 
them permeable to macromolecular complexes. 

More recently still a method termed immunoporation has become a recognised 
25 techinque for the introduction of nucleic acid into cells, see Bildirici et al Nature 
(2000) 405, p298. The technique involves the use of beads coated with an antibody 
to a specific receptor. The transfection mixture includes nucleic acid, antibody coated 
beads and cells expressing a specific cell surface receptor. The coated beads bind the 
cell surface receptor and when a shear force is applied to the cells the beads are 
30 stripped from the cell surface. During bead removal a transient hole is created 
through which nucleic acid and/or other biological molecules can enter. Transfection 
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efficiency of between 40-50% is achievable depending on the nucleic acid used. In 
addition the specificity of cell delivery of RNAi's can be enhanced by association or 
linkage of the RNAi to specific antibodies, ligands or receptors. 

5 There are also a number of commercially available transfection kits which purport to 
provide high efficiency transfection of cells. A kit which is particularly preferred is 
sold under the tradename ExGen 500 tm by MBI Fermentas, Lithuania. ExGen is a 
polyethylenimine, non-liposomal transfection reagent. 

10 According to a further aspect of the invention there is provided a stem loop RNA 
molecule derived from a coding sequence of at least one gene involved in stem cell 
differentiation comprising a first part linked to a second part wherein said first and 
second parts are complementary over at least part of their length and further wherein, 
said first and second parts form a double stranded region by complementary base 

1 5 pairing over at least part of their length. 

In a preferred embodiment of the invention said first and second parts are linked by at 
least one nucleotide base. In a further preferred embodiment of the invention said 
first and second parts are linked by 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotide bases. In a 
20 yet further preferred embodiment of the invention said linker is at least 10 nucleotide 
bases. 

In a preferred embodiment said coding sequence is an exon. 

25 Alternatively said RNA molecule is derived from intronic sequences or the 5' and/or 
3' non-coding sequences which flank coding/exon sequences of genes which mediate 
stem cell differentiation. 

In a further preferred embodiment of the invention the length of the RNA molecule is 
30 between 10 nucleotide bases (nb) -lOOOnb. More preferably still the length of the 
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RNA molecule is selected from lOnb; 20nb; 30nb; 40nb; 50nb; 60nb; 70nb; 80nb; 
90nb. More preferably still said RNA molecule is 21nb in length. 

In a further preferred embodiment of the invention said RNA molecule is lOOnb; 
5 200nb; 300nb; 400nb; 500nb; 600nb; 700nb; 800nb; 900nb; or lOOOnb. More 
preferably still said RNA molecule is at least lOOOnb. 

In a further preferred embodiment of the invention said RNA molecule comprises 
sequences identified in Table 4 or Figures 4-54. 

10 

In yet a further preferred embodiment of the invention said RNA molecules comprise 
modified nucleotide bases. 

It will be apparent to one skilled in the art that the inclusion of modified bases, as 
15 well as the naturally occuring bases cytosine, uracil, adenosine and guanosine, may 
confer advantageous properties on RNA molecules containing said modified bases. 
For example, modified bases may increase the stability of the RNA molecule thereby 
reducing the amount required to produce a desired effect. The provision of modified 
bases may also provide stem-loop structures which are more or less stable. 

20 

According to a further aspect of the invention there is provided a nucleic acid 
molecule encoding at least part of a gene which mediates at least one step in stem cell 
differentiation comprising a first part linked to a second part which first and second 
parts are complementary over at least part of their length, wherein said nucleic .acid 
25 molecule is operably linked to at least one further nucleic acid molecule capable of 
promoting transcription of said nucleic acid linked thereto and further wherein said 
first and second parts form a double stranded region by complementary base pairing 
over at least part of their length as or when said nucleic acid molecule is transcribed. 

30 In a preferred embodiment of the invention said first and second parts are linked by 
linking nucleotides as hereinbefore described. 
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It will be apparent to one skilled in the art that the synthesis of RNA molecules which 
form RNA stem loops can be achieved by providing vectors which include target 
genes, or fragments of target genes, operably linked to promoter sequences. 
5 Typically, promoter sequences are phage RNA polymerase promoters (eg T7, T3, 
SP6). Advantageously vectors are provided with multiple cloning sites into which 
genes or gene fragments can be subcloned. Typically, vectors are engineered so that 
phage promoters flank multiple cloning sites containing the gene of interest. 

10 Alternatively target genes or fragments of target genes can be fused directly to phage 
promoters by creating chimeric promoter/gene fusions via oligo synthesising 
technology. Constructs thus created can be easily amplified by polymerase chain 
reaction to provide templates for the manufacture of RNA molecules comprising 
stem loop RNA's. 

15 

According to a further aspect of the invention there is provided a vector including an 
expression cassette comprising a first sequence linked to a second sequence wherein 
said first and second sequences are complementary over at least part of their lengths 
and further wherein the expression cassette is transciptionally linked to a promoter 
20 sequence. 

In a preferred embodiment of the invention said first and second parts are linked by 
linking nucleotides as hereinbefore described. 

25 Vectors including expression cassettes encoding stem-loop RNA's are adapted for 
eukaryotic gene expression. Typically said adaptation includes, by example and not 
by way of limitation, the provision of transcription control sequences (promoter 
sequences) which mediate cell/tissue specific expression. These promoter sequences 
may be cell/tissue specific, inducible or constitutive. 

30 
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Promoter elements typically also include so called TATA box and RNA polymerase 
initiation selection sequences which function to select a site of transcription 
initiation. These sequences also bind polypeptides which function, inter alia, to 
facilitate transcription initiation selection by RNA polymerase. 

5 

Adaptations also include the provision of selectable markers 
and autonomous replication sequences which both facilitate the maintenance of said 
vector in either the eukaryotic cell or prokaryotic host. Vectors which are maintained 
autonomously are referred to as episomal vectors. Further adaptations which 
10 facilitate the expression of vector encoded genes include the provision of 
transcription termination sequences. 

These adaptations are well known in the art. There is a significant amount of 
published literature with respect to expression vector construction and recombinant 
15 DNA techniques in general. Please see, Sambrook et al (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbour Laboratory, Cold Spring Harbour, NY and 
references therein; Marston, F (1987) DNA Cloning Techniques: A Practical 
Approach Vol m IRL Press, Oxford UK; DNA Cloning: F M Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc.(1994). 

20 

According to a further aspect of the invention there is provided a cell transfected with 
the nucleic acid or vector according to the invention. Preferably said cell is an 
embryonic stem cell or embryonic germ cell. Alternatively said cell is an embryonal 
carcinoma cell. 

25 

According to a further aspect of the invention there is provided a method to 
manufacture stem loop RNA molecules comprising: 

(i) providing a vector or promoter/gene fusion according to the invention; 

30 
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(ii) providing reagents and conditions which allow the synthesis of the RNA 
molecule comprising a stem loop RNA molecule according to the invention; and 

(iii) providing conditions which allow the RNA molecule to base pair over at least 
5 part of its length, or at least that part corresponding to the nucleic acid sequence 

encoding said stem cell gene which mediates stem cell differentiation. 

Preferably said gene, or gene fragment is selected from those genes represented in 
table 4 or Figures 4-54. 

10 

In vitro transcription of RNA is an established methodology. Kits are commercially 
available which provide vectors, ribonucleoside triphosphates, buffers, Rnase 
inhibitors, RNA polymersases (eg phage T7, T3, SP6) which facilitate the production 
of RNA. 

15 

According to a further aspect of the invention there is provided an in vivo method to 
promote the differentiation of stem cells comprising administering to an animal an 
effective amount of stem loop RNA molecule, or vector encoding a stem loop RNA 
molecule according to the invention, sufficient to effect differentiation of a target 
20 stem cell. 

Preferably said method promotes differentiation in vivo of endogenous stem cells to 
repair tissue damage in situ. 

25 It will be apparent to one skilled in the art that stem loop RNA relies on homology 
between the target gene RNA and double stranded region of the stem loop in a 
similar way to conventional RNAi. This confers a significant degree of specificity to 
the stem loop RNA molecule in targeting stem cells. For example, haemopoietic 
stem cells are found in bone marrow and stem loop RNA molecules may be 

30 administered to an animal by direct injection into bone marrow tissue. 
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Stem loop RNA molecules may be encapsulated in liposomes to provide protection 
from an animals immune system and/or nucleases present in an animals serum. 

Liposomes are lipid based vesicles which encapsulate a selected therapeutic agent 
5 which is then introduced into a patient. Typically, the liposome is manufactured 
either from pure phospholipid or a mixture of phospholipid and phosphoglyceride. 
Typically liposomes can be manufactured with diameters of less than 200nm, this 
enables them to be intravenously injected and able to pass through the pulmonary 
capillary bed. Furthermore the biochemical nature of liposomes confers 

10 permeability across blood vessel membranes to gain access to selected tissues. 
Liposomes do have a relatively short half-life. So called STEALTH* liposomes have 
been developed which comprise liposomes coated in polyethylene glycol (PEG). The 
PEG treated liposomes have a significantly increased half-life when administered 
intravenously to a patient. In addition STEALTH R liposomes show reduced uptake 

15 in the reticuloendothelial system and enhanced accumulation selected tissues. In 
addition, so called immuno-liposomes have been develop which combine lipid based 
vesicles with an antibody or antibodies, to increase the specificity of the delivery of 
the RNAi molecule to a selected cell/tissue. 

20 The use of liposomes as delivery means is described in US5580575 and US 5542935. 

It will be apparent to one skilled in the art that the stem loop RNA molecules can be 
provided in the form of an oral or nasal spray, an aerosol, suspension, emulsion, 
and/or eye drop fluid. Alternatively the stem loop RNA molecules may be provided 
in tablet form. Alternative delivery means include inhalers or nebulisers. 

25 

According to a yet further aspect of the invention there is provided a therapeutic 
composition comprising a stem loop RNA molecule according to the invention or a 
vector encoding a stem loop RNA according to the invention. 

30 Preferably said stem loop RNA molecule or vector is for use in the manufacture of a 
medicament for use in promoting the differentiation of stem cells to provide 
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differentiated cells/tissues to treat diseases where cell/tissues are destroyed by said 
disease. 

Typically this includes pernicious anemia; stroke, neurodegenerative diseases such as 
5 Parkinson's disease, Alzhiemer's disease; coronary heart disease; cirrhosis; 
diabetes. It will also be apparent that differentiated stem cells may be used to replace 
nerves damaged as a consequence of (eg replacement of spinal cord tissue). 

In a further preferred embodiment of the invention said therapeutic composition 
10 further comprises a diluent, carrier or excipient. 

According to a further aspect of the invention there is provided a cell obtainable by 
the method according to the invention. 

15 It will be apparent that a cell obtainable by the method according to the invention has 
useful applications . For example, a stably transfected cell under the control of a 
regulatable promoter (ie inducible, repressible, developmental^ regulated, cell 
lineage regulated, cell-cycle regulated) offers the opportunity to modulate the 
expression of the stem-loop RNA in said cell thereby modulating the differentiation 

20 state, or not as the case maybe, in culture or in viyo. 

According to a yet further aspect of the invention there is provided at least one organ 
comprising at least one cell obtainable by the method according to the invention. 

25 According to a yet further aspect of the invention there is provided a non-human • 
transgenic animal comprising a RNA molecule according to the invention, or a 
nucleic acid molecule according to the invention, or a vector according to the 
invention. 

30 An embodiment of the invention will now be described by example only and with 
reference to the following figures and tables wherein: 
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Table 1 represents a selection of antibodies used to monitor stem cell differentiation; 

Table 2 represents nucleic acid probes used to assess mRNA markers of stem 
5 differentiation; 

Table 3 represents protein markers of stem cell differentiation; 

10 Table 4 represents specific primers used to generate stem loop RNA for gene 
specific inhibition; 

Table 5 represents vectors used for the expression of stem loop RNA in cells 
including the promoters used to drive transcription of stem loop RNA's. 

15 

Figure 1 illustrates stem cell differentiation is controlled by positive and negative 
regulators (A). The specific cell phenotypes that are derived are a direct result of 
positive and negative regulators which activate or suppress particular differentiation 
events. Stem loop RNA can be used to control both the initial differentiation of stem 
20 cells (A) and the ultimate fate of the differentiated cells Dl and D2 by repression of 
positive activators which would normally promote a particular cell fate; 

Figure 2 represents the Oct 4 nucleic acid sequence from position 610-1032 of the 
sequence found in GenBank accession number NM_002701. 

25 

Fig 3A illustrates a transcription cassette comprising a promoter sequence operable 
linked to a nucleic acid encoding a stem loop RNA; Fig 3B illustrates a stem loop 
RNA synthesised from the cassette illustrated in Fig 1 A; 

30 Figure 4 is the nucleic acid sequence of murine notch ligand delta-like 1 ; 

Figure 5 is the nucleic acid sequence of murine notch ligand jagged 1; 
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Figure 6 is the nucleic acid sequence of human notch ligand jagged 1 (alagille 
syndrome) (JAG1); 

Figure 7 is the nucleic acid sequence of human notch ligand jagged 2 (JAG2) 

5 

Figure 8 is the nucleic acid sequence of murine notch ligand jagged 2; 
Figure 9 is the nucleic acid sequence of human notch ligand delta-like 3 (DLL3); 
10 Figure 10 is the nucleic acid sequence of human notch ligand delta-1 (DLL1); 

Figure 11 is the nucleic acid sequence of human notch ligand delta-like 4 (DLL4); 
Figure 12 is the nucleic acid sequence of murine notch ligand delta-like 4(DLL4); 

15 

Figure 13 represents the nucleic acid sequence of human Wnt 13\ 
Figure 14 represents the nucleic acid sequence of human dickkopfl; 
20 Figure 15 represents the nucleic acid sequence of human dickkop/2; 

Figure 16 represents the nucleic acid sequence of human dickkopf3\ and 
Figure 17 represents the nucleic acid sequence of human dickkopf4\ 

25 

Figure 18 represents the nucleic acid sequence of WNT-1; 
Figure 19 represents the nucleic acid sequence of WNT-2; 
30 Figure 20 represents the nucleic acid sequence of WNT 2B; 
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Figure 21 represents the nucleic acid sequence of WNT 3; 
Figure 22 represents the nucleic acid sequence of WNT 4; 
5 Figure 23 represents the nucleic acid sequence of WNT 5 A; 
Figure 24 represents the nucleic acid sequence of WNT 6; 
Figure 25 represents the nucleic acid sequence of WNT 7A; 

10 

Figure 26 represents the nucleic acid sequence of WNT 8B; 
Figure 27 represents the nucleic acid sequence of WNT 10B; 
1 5 Figure 28 represents the nucleic acid sequence of WNT 1 1 ; 
Figure 29 represents the .nucleic acid sequence of WNT 14 
Figure 30 represents the nucleic acid sequence of WNT 16; 

20 

Figure 31 represents the nucleic acid sequence of FZD 1; 
Figure 32 represents the nucleic acid sequence of FZD 2; 
25 Figure 33 represents the nucleic acid sequence of FZE 3; 
Figure 34 represents the nucleic acid sequence of FZD 4; 
Figure 35 represents the nucleic acid sequence of FZD 5; 

30 

Figure 36 represents the nucleic acid sequence of FZD 6; 
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Figure 37 represents the nucleic acid sequence of FZD 7; 
Figure 38 represents the nucleic acid sequence of FZD 8; 

5 

Figure 39 represents the nucleic acid sequence of FZD 9; 
Figure 40 represents the nucleic acid sequence of FZD 10; 
10 Figure 41 represents the nucleic acid sequence of FRP; 
Figure 42 represents the nucleic acid sequence of SARP 1; 
Figure 43 represents the nucleic acid sequence of SARP 2; 

15 

Figure 44 represents the nucleic acid sequence of FRZB; 
Figure 45 represents the nucleic acid sequence of FRPHE; 
20 Figure 46 represents the nucleic acid sequence of SARP 3; 
Figure 47 represents the nucleic acid sequence of CER 1; 
Figure 48 represents the nucleic acid sequence of DKK1; 

25 

Figure 49 represents the nucleic acid sequence of DKK 2; 
Figure 50 represents the nucleic acid sequence of DKK 3; 
30 Figure 51 represents the nucleic acid sequence of DKK 4; 
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Figure 52represents the nucleic acid sequence of WIF-1; 
Figure 53 represents the nucleic acid sequence of SRFP 1; 
5 Figure 54 represents the nucleic acid sequence of SRFP 4; 



10 



15 Materials and Methods 
Cell Culture 

NTERA2 and 2102Ep human EC cell lines were maintained at high cell density as 
previously described (Andrews et al 1982, 1984b), in DMEM (high glucose 
20 formulation) (DMEM)(GIBCO BRL), supplemented with 10% v/v bovine foetal calf 
serum (GIBCO BRL), under a humidified atmosphere with 10% CO2 in air. 

Stem Loop RNA Production 

25 Primers were designed against specific target genes with T7 bacteriophage promoters 
at their 5' ends . The primers consist of typically 18- 25 bp against the target gene, a 
linker sequence of variable length (indicated by N in primer sequence) followed by 
the reverse complement of the gene specific sequence. The primers were used in a 
standard RNA in vitro, transcription reaction using a MEGASCRJPT kit following 

30 manufacturers protocols (Ambion, USA). Longer slRNA templates were produced 
buy cloning head-to -tail the sense and anti-sense gene specific sequences to generate 
a palindromic template from which RNA could be synthesized. 

The following primers were used 

35 
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Gene 


Accession 
Number 


Primer Sequence 


Oct4 


Z11899 


TAA TAC GAC TCA CTA TAG 
Ggagcagcttgggctcgagaag(N)cttctcgagcccaagctgctc 


HsNotch2 




TAA TAC GAC TCA CTA TAGGt cgt gca aga gcc 
agt tac cc(N)gg gta act ggc tct tgcacg a 


HsNotchl 


M73980 


TAA TAC GAC TCA CTA TAGGa atg gtc aat gcg 
agt ggc tgt cc(N)gg aca gcc act cgc gtt gac cat t 


CIF 




TAA TAC GAC TCA CTA TAGGa gta gtg aga gtg 
aga gta aca(N)tgt tac tct cac tct cac tac t 


RBPJ-kappa 




TAA TAC GAC TCA CTA TAGGt cctgtg cctgtg gta 
gag a(N)t etc tac cac agg cac agg a 


Dlkl 


NM_002226 


TAA TAC GAC TCA CTA TAGGcctc ttg etc ctg ctg 
get tt(N)aaagccagcaggagcaagagg 



Capital letters indicate the T7 polymerase promoter sequence. 

5 In each case, a quantity of the PCR was electrophoresed through agarose to verify, 
product size and abundance, whilst the remainder was purified by alkaline 
phenol/chloroform extraction. RNA was synthesized using the Megascript kit 
(Ambion Inc.) according to the manufacturer's protocol and acid phenol/chloroform 
extracted. The simultaneous synthesis of complementary strands of RNA in a single 
10 reaction circumvents the requirement for an annealing step. However, the quality and 
duplexing of the synthesized RNA was confirmed by agarose gel electrophoresis, 
with the desired products migrating as expected for double stranded DNA of the 
same length. 

15 Stem Loop RNA introduction to Cell Lines 

Human EC stem cells were seeded at 2 XI 0 5 cells/well of a 6 well plate in 3 cm 3 of 
Dulbecco's modified Eagles medium and allowed to settle for 3 hrs. 
Appx. 9.5|ig of DNA was incubated with an optimised amount of ExGEN 500 for 
20 each well of a 6-well plate. Previously cells were seeded 1 day before. This gives 
apprx. a 70% confluent culture. The DNA/ExGen mixture was added to the cells and 
the culture vessel spun at 280g for 5 mins. 

Total RJVA production 
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Growing cultures of cells were aspirated to remove the DME and foetal calf serum. 
Trace amounts of foetal calf serum was removed by washing in Phosphate-buffered 
saline. Fresh PBS was added to the cells and the cells were dislodged from the 
5 culture vessel using acid washed glass beads. The resulting cell suspension was 
centrifuged at 300xg. The pellets had the PBS aspirated from them. Tri reagent 
(Sigma, USA) was added at 1ml per 10 7 cells and allowed to stand for 10 mins at 
room temperature. The lysate from this reaction was centrifuged at 12000 x g for 15 
minutes at 4°C. The resulting aqueous phase was transferred to a fresh vessel and 
10 0.5 ml of isopropanol / ml of trizol was added to precipitate the RNA. The RNA was 
pelleted by centrifugation at 12000 x g for 10 mins at 4°C. The supernatant was 
removed and the pellet washed in 70% ethanol. The washed RNA was dissolved in 
DEPC treated double-distilled water. 

15 Analysis of the differentiation of EC stem cells induced bv exposure to Stem Loop 
RNA 

Following exposure to stem loop RNA corresponding to specific key regulatory 
genes, the subsequent differentiation of the EC cells was monitored in a variety of 
20 ways. One approach was to monitor the disapearance of typical markers of the stem 
cell phenotype; the other was to monitor the appearance of markers pertinent to the 
specific lineages induced. The relevant markers included surface antigens, mRNA 
species and specific proteins. 

25 Analysis of Transfectants bv Antibody Staining and FACS 

Cells were treated with trypsin (0.25% v/v) for 5 mins to disaggregate the cells; they 
were washed and re-suspended to 2xl0 5 cells/ml. This cell suspension was incubated 
with 50^1 of primary antibody in a 96 well plate on a rotary shaker for 1 hour at 4°C. 
30 Supernatant from a myeloma cell line P3X63Ag8, was used as a negative control 
The 96 well plate was centrifuged at lOOrpm for 3 minutes. The plate was washed 3 
times with PBS containing 5% foetal calf serum to remove unbound antibody. Cell 
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were then incubated with 50 \il of an appropriate FITC-conjugated secondary 
antibody at 4°C for 1 hour. Cells were washed 3 times in PBS + 5% foetal calf 
serum and analysed using an EPICS elite ESP flow cytometer (Coulter eletronics, 
U.K).(Andrews et. al., 1982) 

5 

Northern blot Analysis of RNA 

RNA separation relies on the generally the same principles as standard DNA but with 
some concessions to the tendancy of RNA to hybridise with itself or other RNA 
molecules. Formaldehyde is used in the gel matrix to react with the amine groups of 

10 the RNA and form Schiff bases. Purified RNA is run out using standard agarose gel 
electrophoresis. For most RNA a 1% agarose gel is sufficiant. The agarose is made in 
IX MOPS buffer and supplemeted with 0.66M formaldehyde.Dryed down RNA 
samples are reconstituted and denatured in RNA loading buffer and loaded into the 
gel. Gels are run out for apprx. 3 hrs (until the dye front is 3/4 of the way down the 

15 gel). 

The major problem with obtaining clean blotting using RNA is the presence of 
formaldehyde. The run out gel was soaked in distilled water for 20 mins with 4 
changes, to remove the formaldehyde from the matrix. The transfer assembly was 

20 assembled in exactly the same fashion as for DNA (Southern) blotting.The transfer 
buffer used however was 10X SSPE. Gels were transfered overnight. The membrane 
was soaked in 2X SSPE to remove any agarose from the transfer assembly and the 
RNA was fixed to the memebrane. Fixation was acheived using short-wave (254 nM) 
UV light. The fixed membrane was baked for 1-2 hrs to drive off any residual 

25 formaldehyde. 

Hybridisation was acheived in aqueous phase with formamide to lower the 
hybridisation temperatures for a given probe. RNA blots were prehybridised for 2-4 
hrs in northern prehybridisation soloution. Labelled DNA probes were denatured at 
30 95 °C for 5 mins and added to the blots. All hybridisation steps were carried out in 
rolling bottles in incubation ovens. Probes were hybridised overnight for at least 16 
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hrs in the prehybridisation soloution. A standard set of wash soloutions were used. 
Stringency of washing was acheived by the use of lower salt containing wash buffers. 
The following wash procedure is outlined as follows 



2X S SPE 1 5 mins room temp 

5 2XSSPE 15 mins room temp 

2X SSPE/ 0. 1% SDS 45 mins 65°C 

2X SSPE/ 0. 1% SDS 45 mins 65°C 

0. IX SSPE 15 mins room temp 



10 Preparation of radiolabeled DNA probes 

The method of Feinberg and Vogelstein (Feinberg and Vogelstein, 1983) was used to 
radioactively label DNA. Briefly, the protocol uses random sequence hexanucleotides 
to prime DNA synthesis at numerous sites on a denatured DNA template using the 
15 Klenow DNA polymerase I fragment Pre-formed kits were used to aid consistency . 
5-100ng DNA fragment (obtained from gel purifcation of PCR or restriction digests) 
was made up in water,denatured for 5 mins at 95°C with the random hexamers. The 
mixture was quench cooled on ice and the following were added, 
5 \xl [a-32P] dATP 3000 Ci/mmol 
1 jil of Klenow DNA polymerase (4U) 

The reaction was then incubated at 37°C for 1 hr. Unincorporated nucleotide were 
removed with spin columns ( Nucleon Biosciences). 

Production of cDNA 

The enzymatic conversion of RNA into single stranded cDNA was achieved using 
the 3' to 5' polymerase activity of recombinant Moloney-Murine Leukemia Virus 
(M-MLV) reverse transcriptase primed with oligo (dT) and (dN) primers. For 
Reverse Transcription-Polymerase Chain Reaction, single stranded cDNA was used. 
cDNA was synthesised from lfag poly (A)+ RNA or total RNA was incubated with 
the following 

1 .0|oM oligo(dT) primer for total RNA or random hexcamers for mRNA 
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0.5mM lOmM dNTP mix 

1 U/|il RN Ase inhibitor (Promega) 

l.OU/p.1 M-MLV reverse transcriptase in manufacturers supplied buffer 
(Promega) 

5 The reaction was incubated for 2-3 hours at 42°C 
Fluorescent Automated Sequencing 

To check the specificity of the PCR primers used to generate the template used in 
stem loop RNA production automatic sequencing was carried out using the prism 

10 fluorescently labelled chain terminator sequencing kit (Perkin-Elmer) (Prober et al 
1987). A suitable amount of template (200ng plasmid, lOOng PCR product), 10 jjM 
sequencing primer (typically a 20mer with 50% G-C content) were added to 8 |il of 
prism pre-mix and the total reaction volume made up to 20 \xl. 24 cycles of PCR 
(94°C for 10 seconds, 50°C for 10 seconds, 60°C for 4 minutes). Following thermal 

15 cycling, products were precipitated by the addition of 2|al of 3M sodium acetate and 
50 |il of 100 % ethanol. DNA was pelleted in an Eppendorf microcentrifuge at 13000 
rpm, washed once in 70% ethanol and vacuum dried. Samples were analysed by the 
in-house sequencing Service (Krebs Institute). Dried down samples were 
resuspended in 4 ^1 of formamide loading buffer, denatured and loaded onto a ABI 

20 373 automatic sequencer. Raw sequence was collected and analysed using the ABI 
prism software and the results were supplied in the form of analysed histogram 
traces. 

Detection of specific protein targets by SDS-PAGE and Western Blotting 

25 

To obtain cell lysates monolayers of cells were rinsed 3 times with ice-cold PBS 
supplemented with 2 mM CaC^. Cells were incubated with 1 ml/75 cm 2 flask lysis 
buffer (1% v/v NP40, 1% v/v DOC, 0.1 mM PMSF in PBS) for 15 min at 4°C. Cell 
lysates were transferred to eppendorf tubes and passed through a 21 gauge needle to 
30 shear the DNA. This was followed by freeze thawing and subsequent centrifugation 
(30 min, 4°C, 15000g) to remove insoluble material. Protein concentrations of the 
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supernatants were determined using a commercial protein assay (Biorad). Samples 
were prepared for SDS-PAGE by adding 6 times Laemmli electrophoresis sample 
buffer and boiling for 5 min. After electrophoresis with 16 jig of protein on a 10% 
polyacrylamide gel (Laemmli, 1970) the proteins were transferred to PVDF 
5 membrane. The blots were washed with PBS and 0.05% Tween (PBS-T). Blocking 
of the blots occurred in 5% milk powder in PBS-T (60 min, at RT). Blots were 
incubated with the appropriate primary antibody. Horseradish peroxidase labelled 
secondary antibody was used to visualise antibody binding by ECL (Amersham, 
Bucks., UK). Materials used for SDS-PAGE and western blotting were obtained from 
10 Biorad (California, USA) unless stated otherwise. 



Table 1: Antibodies used to detect stem cell differentiation 



Antibody 


Class 


Species 


Cell 

phenotype 
detected 


Changes on 

Differentiatio 

n 


Reference 


TRA-1- 
60 


IgM 


Mouse 


Human EC, 
ES cells. 


I 

differentiation 


Andrews et.al., 
1984a 


TRA-1- 
81 


IgM 


Mouse 


Human EC, 
ES cells. 


I 

differentiation 


Andrews et. 
al.,1984a 


SSEA3 


IgM 


Rat 


Human EC, 
ES cells. 


I 

differentiation 


Shevinsky et al 
1982, Fenderson 
etall987 


SSEA4 


IgG 


Mouse 


Human EC, 
ES cells. • 


differentiation 


Kannagi et al 
1983 Fenderson 
etall987 


A2B5 


IgM 


Mouse 




t 

differentiation 


Fenderson et al 
1987 


ME311 


IgG 


Mouse 




t 

differentiation 


Fenderson et al 
1987 


VIN-IS- 
56 


IgM 


Mouse 




t 

differentiation 


Andrews et al 
1990 


. VTN-IS- 
53 


IgG 


Mouse 




f 

differentiation 


Andrews et al 
1990 















15 

Table 2: Probes used to assess mRNA markers of differentiation 

29 



WO 03/012082 



PCT/GB02/03409 



Gene 


Cell Type 


Synaptophysin 


Neuron 


NeuroDl 


Neuron 


MyoDl 


Muscle 


Collagens 


Cartlidge 


Alpha-actin 


Skeletal muscle 


Smooth-muscle actin 


Smooth muscle 



5 



10 

Table 3: Protein markers of differentiation, detected by Western Blot and/or 
immunofluorescence. 

15 The following antibodies were detected by the appropriate commercially 
available antibodies 



Cell Type 


Antigen 


Neurons 


Neurofilaments 


Glial cells 


GFAP 


Epithelial cells 


Cytokeratins 


Mesenchymal cells 


Vimentin 


Muscle 


Desmin 


Muscle 


Tissue specific actios 


Connective tissue cells 


Collagens ' 
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Table 4: Specific Primers used to generate Stem Loop RNA for gene specific 
inhibition 



5 All sequences written 5' to 3 9 





Gene Name 


Accession 
number 


PCR primer Sequences 


Position 


Notch Pathway 


Ligands: 










Dll-1 


AF003522 








DID 


NM_016941 








D114 


NM_019454 








Dlk-1 


NM_003836 








Jaggedl 


U73936 








Jagged2 


NM_002226 






Receptors: 










Notchl 


M73980 


gcggccgcctttgtggttctgttc 
gccggcgcgtcctcctcttcc 


5224-5726 




Notch2 


In-house 
sequence 


gccagaatgatgctacctgt 
tagagcagcaccaatggaac 






Notch3 


U97669 


Aagttacccccaagaggcaagtgtt 
Aaggaaatgagaggccagaagga 

ga 


7013-7348 




Notch4 


U95299 


ggctgcccctcccactctcg 
cagcccgggccccaggatag 


3727-4132 


Downstream: 










TLE-1 


NM_005077 








TLE-2 


M99436 








TLE-3 


M99438 








TLE-4 


M99439 
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TCF7 


NM_003202 








TCFFL2 


Y11306 








TCF3 


M31523 








TCF19 


NM_007109 








TCF1 


NM_000545 








infringe 


NM_002405 








lfringe 


U94354 








rFringe 


AF1 08139 








Sell 


AF157516 








Numb 


NM_003744 








LNX 


NM_0 10727 






Wingless Pathway 








Ligands 










Wntl 


NM_005430 








Wnt2 


NM_003391 








Wnt2B 


NM_004185 


tgagtggttcctgtactctg 
actcacactgggtaacacgg 


1159-1503 




Wnt5A 


L20861 








Wnt6 


AF079522 








Wnt7A 


NM_004625 








Wnt8B 


NM_003393 








WntlOB 


NM_003394 








Wntll 


NM_004626 








Wntl4 


AF028702 








Wntl5 


AF028703 








Wntl6 


AJbl 69963 






Receptors 










FZD1 


NMJW3505 








FZD2 


NM 001466 


tacccagagcggcctatcattttt 


955-1439 
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acgaagccggccaggaggaagga 
c 






FZD3 


NM_017412 








FZD4 


NM.012193 








FZD5 


NM_003468 








FZD6 


NM_003506 


Tggcctgaggagcttgaatgtgac 
Atcgcccagcaaaaatccaatgaa 


607-1026 




FZD7 


NM_003507 








FZD8 


AA481448 








FZD9 


NM_003508 








FZD10 


NM_007197 








FRZB 


NM_001463 






Extracellular 
Effectors 












SFRP1 


NM_003012 








SFRP2 


AF017986 








SFRP4 


AF026692 


agaggagtggctgcaatgaggtc 
gcgcccggctgttttctt 


877-1178 




SFRP5 


NM_003015 








SK 


AB020315 








CER1 


NM_005454 








WIF-1 


NM_007191 








DVL1 


U46461 








DVL2 


NM_004422 








DVL3 


NM_0U4423 






T , r , €iTici*i*intinn 1 


factors 










Oct4 


Z11899 








Brachyury 


NMJJ03181 
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NeuroDl 


NM_002500 








NeuroD2 


NM_006160 








NeuroD3 


U63842 








MyoD 


NM_002478 








MDFI 


NM_005586 








REST 


NM_005612 



























Table 5 

Listed are examples of vector systems that are to be used in cells to direct the 
production of stem loop RNA, 



Expression System 


Vectors 


Accession numbers 


Promoters 


Tet-on/Tet-off 

Clontech, USA 


pTet-on 
pTet-off 
pTRE2-Hyg 


U89930 
U89929 


CMV 

MyoDl 

NeuroDl 

Oct4 

GATAl 

Beta-actin 

PGK 


IRES 

Invitrogen, 
Nethelands) 


pIRES-EGFP 




CMV 

MyoDl 

NeuroDl 

Oct4 

GATAl 

Beta-actin 

PGK 


Ecdysone 

Invitrogen, 
Netherlands 


pIND 
pVgRXR 
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CLAIMS 

1 . A method to modulate the differentiation state of a stem cell comprising: 

i) contacting a stem cell with at least one nucleic acid molecule comprising a 
5 sequence of a gene which mediates at least one step in the differentiation of said cell 
which nucleic acid molecule consists of a first part linked to a second part wherein 
said first and second parts are complementary over at least part of their length and 
further wherein said first and second parts form a double stranded region by 
complementary base pairing over at least part of their length; 
10 (ii) providing conditions conducive to the growth and differentiation of the cell 
treated in (i) above; and optionally 

(hi) maintaining and/or storing the cell in a differentiated state. 

2. A method according to Claim 1 wherein said first and second parts are linked 
15 by at least one nucleotide base. 

3 A method according to Claim 1 or 2 wherein said nucleic acid molecule is a 
stem loop RNA molecule or a nucleic acid molecule or a vector encoding said stem 
loop RNA. 

20 

4. A method according to any of Claims 1-3 wherein said conditions are in vitro 
cell culture conditions. 

5. A method according to any of Claims 1-4 wherein said stem cell is selected 
25 from the group consisting of: an embryonic stem cell; an embryonic germ cell; an 

embryonal carcinoma cell; a haemopoietic stem cell; a muscle stem cell; a nerve 
stem cell; a skin dermal sheath stem cell; a liver stem cell; a teratocarcinoma cell. 

6. A method according to any of Claims 1-5 wherein said stem cell is an 
30 embryonic stem cell or embryonic germ cell. 
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7. A method according to any of Claims 1-6 wherein said nucleic acid molecule 
is derived from at least one nucleic acid sequence as represented by Figures 4- 54. 

8. A RNA molecule derived from a coding sequence of at least one gene 
5 involved in stem cell differentiation comprising a first part linked to a second part 

wherein said first and second parts are complementary over at least part of then- 
length and further wherein said first and second parts form a double stranded region 
by complementary base pairing over at least part of their length. 

10 9. A RNA molecule according to Claim 8 wherein said first and second parts 
are linked by at least one nucleotide base (nb). 

10. A RNA molecule according to Claim 9 wherein said first and second parts 
are linked by 2, 3, 4, 5, 6, 7, 8, 9, or lOnb in length. 

15 

11. A RNA molecule according to Claim 9 wherein said linker is at least lOnb in 
length. 

12. A RNA molecule according to any of Claims 8-11 wherein the length of the 
20 RNA molecule is between 1 Onb -1 OOOnb in length. 

13. A RNA molecule according to Claim 12 wherein the length of the RNA 
molecule is selected from lOnb; 20nb; 30nb; 40nb; 50nb; 60nb; 70nb; 80nb; 90nb in 
length. 

25 

14. A RNA molecule according to Claim 12 wherein said RNA molecule is 
lOOnb; 200nb; 300nb; 400nb; 500nb; 600nb; 700nb; 800nb; 900nb; or lOOOnb in 
length. 

30 15. A RNA molecule according to Claim 8 wherein said RNA molecule is at 
least 1 OOOnb in length. 
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16. A RNA molecule according to Claim 8 wherein said RNA molecule is 

21nb in length. 

5 17. A RNA molecule according to any of Claims 8-16 wherein said RNA 

molecule comprises sequences identified in Figures 4-54. 

18. A RNA molecule according to any of Claims 8-17 wherein said RNA 
molecules comprise modified nucleotide bases. 

10 

19. A nucleic acid molecule which encodes an RNA molecule according to any of 
Claims 8-18 wherein said nucleic acid molecule is operably linked to at least one 
further nucleic acid molecule capable of promoting transcription of said nucleic acid 
linked thereto. 

15 

20. A nucleic acid molecule according to Claim 19 wherein said further nucleic 
acid molecule is a promoter capable of inducible transcription. 

21. A vector including a nucleic acid molecule according to Claim 19 or 20. 

20 

22. A cell transfected with an RNA molecule according to any of Claims 8-18, 
nucleic acid molecule according to Claim 19 or 20 or a vector according to Claim 
21. 

25 23. A cell according to Claim 22 wherein said cell is an embryonic stem cell or 
embryonic germ cell. 

24. A cell according to Claim 22 wherein said cell is an embryonal carcinoma 
cell. 

30 

25. A method to manufacture stem loop RNA molecules comprising: 
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(i) providing a nucleic acid molecule according to Claim 19 or 20 or a vector 
according to Claim 21; 

5 (ii) providing reagents and conditions which allow the synthesis of the RNA 
molecule comprising a RNA molecule according to any of Claims 8-18; and 

(iii) providing conditions which allow the RNA molecule to base pair over at least 
part of its length, or at least that part corresponding to the nucleic acid sequence 
10 encoding said stem cell gene which mediates stem cell differentiation. 

26. An in vivo method to promote the differentiation of stem cells comprising 
administering to an animal an effective amount of an RNA molecule according to any 
of Claims 8-18, a nucleic acid molecule according to Claim 19 or 20 or a vector 

1 5 according to Claim 2 1 , sufficient to effect differentiation of a target stem cell. 

27. A RNA molecule according to any of Claims 8-18, a nucleic acid molecule 
according to Claim 19 or 20 or a vector according to Claim 21 for use as a 
pharmaceutical. 

20 

28. A pharmaceutical composition comprising a RNA molecule according to any 
of Claims 8-18, a nucleic acid molecule according to Claim 19 or 20 or a vector 
according to Claim 21. 

25 29. Use of a RNA molecule according to any of Claims 8-18, a nucleic acid 
molecule according to Claim 19 or 20 or a vector according to Claim 21 for the 
manufacture of a medicament for use in promoting the differentiation of stem cells to 
provide differentiated cells/tissues to treat diseases where cell/tissues are destroyed 
by said disease. 

30 
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30 Use according to Claim 29 wherein said disease is selected from the group 
consisting of: pernicious anemia; stroke, neurodegenerative diseases such as 
Parkinson's disease, Alzhiemer's disease; coronary heart disease; cirrhosis; 
diabetes; nerves damaged as a consequence of trauma (e.g. replacement of spinal 
5 cord tissue). 

31. Acell obtainable by the method according to anyof Claims 1-7. 

32. An organ comprising at least one cell according to Claim 3 1 . 

10 

33 . A non-human transgenic animal comprising a RNA molecule according to 
any of Claims 8-18, or a nucleic acid molecule according to Claim 19 or 20, or a 
vector according to Claim 21. 



20 
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30 
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Figure 2 



AGCAGCTTGGGCTCGAGAAGGATGTGGTCCGAGTGTGGTTCTGTAACCGGCGCCAG 

AAGGGCAAGCGATCAAGCAGCGACTATGCACAACGAGAGGATTTTGAGGCTGCTGG 

GTCTCCTTTCTCAGGGGGACCAGTGTCCTTTCCTCTGGCCCCAGGGCCCCATTTTGGT 

GCCCCAGGCTATGGGAGCCCTCACTTCACTGCACTGTACTCCTCGGTCCCTTTCCCTG 

AGGGGGAAGCCTTTCCCCCTGTCTCTGTCACCACTCTGGGCTCTCCCTTGCATTCAAA 

CTGAGGTGCCTGCCTGCCCTTCTAGGAATGGGGGACAGGGGGAGGGGAGGAGCTAG 

GGAAAGAAAACCTGGAGTTTGTGCCAGGGTTTTTGGATTAAGTTCTTCATTCACTAA 

GGAAGGAATTGGGAACACAAAGGG 
3' 
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Figure 3 

GTCCAGCGGTACCATGGGCCGTCGGAGCGCGCTAGCCCTTGCCGTGGTCTCTGCCCTGCTGTGC 

CAGGTCTGGAGCTCCGGCGTATTTGAGCTGAAGCTGCAGGAGTTCGTCAACAAGAAGGGGCTG 

CTGGGGAACCGCAACTGCTGCCGCGGGGGCTCTGGCCCGCCTTGCGCCTGCAGGACCTTCTTTC 

GCGTATGCCTCAAGCACTACCAGGCCAGCGTGTCACCGGAGCCACCCTGCACCTACGGCAGTG 

CTGTCACGCCAGTGCTGGGTGTCGACTCCTTCAGCCTGCCTGATGGCGCAGGCATCGACCCCGC 

CTTCAGCAACCCCATCCGATTCCCCTTCGGCTTCACCTGGCCAGGTACCTTCTCTCTGATCATTG 

AAGCCCTCCATACAGACTCTCCCGATGACCTCGCAACAGAAAACCCAGAAAGACTCATCAGCC 

GCCTGACCACACAGAGGCACCTCACTGTGGGAGAAGAATGGTCTCAGGACCTTCACAGTAGCG 

GCCGCACAGACCTCCGGTACTCTTACCGGTTTGTGTGTGACGAGCACTACTACGGAGAAGGTTG 

CTCTGTGTTCTGCCGACCTCGGGATGACGCCTTTGGCCACTTCACCTGCGGGGACAGAGGGGAG 

AAGATGTGCGACCCTGGCTGGAAAGGCCAGTACTGCACTGACCCAATCTGTCTGCCAGGGTGT . 

GATGACCAACATGGATACTGTGACAAACCAGGGGAGTGCAAGTGCAGAGTTGGCTGGCAGGGC 

CGCTACTGCGATGAGTGCATCCGATACCCAGGTTGTCTCCATGGCACCTGCCAGCAACCCTGGC 

AGTGTAACTGCCAGGAAGGCTGGGGGGGCCTTTTCTGCAACCAAGACCTGAACTACTGTACTCA 

CCATAAGCCGTGCAGGAATGGAGCCACCTGCACCAACACGGGCCAGGGGAGCTACACATGTTC 

CTGCCGACCTGGGTATACAGGTGCCAACTGTGAGCTGGAAGTAGATGAGTGTGCTCCTAGCCCC 

TGCAAGAACGGAGCGAGCTGCACGGACCTTGAGGACAGCTTCTCTTGCACCTGCCCTCCCGGCT 

TCTATGGCAAGGTCTGTGAGCTGAGCGCCATGACCTGTGCAGATGGCCCTTGCTTCAATGGAGG 

ACGATGTTCAGATAACCCTGACGGAGGCTACACCTGCCATTGCCCCTTGGGCTTCTCTGGCTTC 

AACTGTGAGAAGAAGATGGATCTCTGCGGCTCTTCCCCTTGTTCTAACGGTGCCAAGTGTGTGG 

ACCTCGGCAACTCTTACCTGTGCCGGTGCCAGGCTGGCTTCTCCGGGAGGTACTGCGAGGACAA 

TGTGGATGACTGTGCCTCCTCCCCGTGTGCAAATGGGGGCACCTGCCGGGACAGTGTGAACGAC 

TTCTCCTGTACCTGCCCACCTGGCTACACGGGCAAGAACTGCAGCGCCCCTGTCAGCAGGTGTG 

AGCATGCACCCTGCCATAATGGGGCCACCTGCCACCAGAGGGGCCAGCGCTACATGTGTGAGT 

GCGCCCAGGGCTATGGCGGCCCCAACTGCCAGTTTCTGCTCCCTGAGCCACCACCAGGGCCCAT 

GGTGGTGGACCTCAGTGAGAGGCATATGGAGAGCCAGGGCGGGCCCTTCCCCTGGGTGGCCGT 

GTGTGCCGGGGTGGTGCTTGTCCTCCTGCTGCTGCTGGGCTGTGCTGCTGTGGTGGTCTGCGTCC 

GGCTGAAGCTACAGAAACACCAGCCTCCACCTGAACCCTGTGGGGGAGAGACAGAAACCATGA 

ACAACCTAGCCAATTGCCAGCGCGAGAAGGACGTTTCTGTTAGCATCATTGGGGCTACCCAGAT 

CAAGAACACCAACAAGAAGGCGGACTTTCACGGGGACCATGGAGCCAAGAAGAGCAGCTTTA 

AGGTCCGATACCCCACTGTGGACTATAACCTCGTTCGAGACCTCAAGGGAGATGAAGCCACGG 

TCAGGGATACACACAGCAAACGTGACACCAAGTGCCAGTCACAGAGCTCTGCAGGAGAAGAG 

AAGATCGCCCCAACACTTAGGGGTGGGGAGATTCCTGACAGAAAAAGGCCAGAGTCTGTCTAC 

TCTACTTCAAAGGACACCAAGTACCAGTCGGTGTATGTTCTGTCTGCAGAAAAGGATGAGTGTG 

TTATAGCGACTGAGGTGTAAGATGGAAGCGATGTGGCAAAATTCCCATTTCTCTCAAATAAAAT 

TCCAAGGATATAGCCCCGATGAATGCTGCTGAGAGAGGAAGGGAGAGGAAACCCAGGGACTG 

CTGCTGAGAACCAGGTTCAGGCGAAGCTGGTTCTCTCAGAGTTAGCAGAGGCGCCCGACACTG 

CCAGCCrAGGCTTTGG.CTGCCGCTGGACTGCCTGCTGGTTGTTCCCATTGCACT 

CTTTGAAGAGTATATATTTAAATGGACGAGTGACTTGATTCATATAGGAAGCACGCACTGCCCA 

CACGTCTATCITGGATTACTATGAGCCAGTCTTTCCTTGAACTAGAAACACAACrGCCTTTA 

TCCTITITGATACrrGAGATGTGTTTITITITITCCTAGACGGGA 

TTTrGGGATTTGTAAAAATATTTTTCATGATATCTGTAAAGCTTGAGTATTT^ 

TTTTrATAATTTAAATTTTGGTAAATATGTACAAAGGCACTTCGGGTCTATGTGACTATA 

TTGTATATAAATGTATTTATGGAATATTGTGG 

GAAGAAATTCATTTTAAAAATATTITrCCAAAATAAATATAATGAACTACA 



Figure 4 

CGGGCAGAGGTGGAAGAGGGGGGAGCGCCTCAAAGAAGCGATCAGAATAATAAAAGGAGGCC 

GGGCTCTrTGCCTTCTGGAACGCGCGGCTCTTGAAAGGGCTTTTGAAAAGTAGTGTTGTTTTCC 

GTCGTGCATGCTCCAATCCACGGAGTATATTAGAGCCGGGACGCGGCGGCCGCGGGGGCAGCG 

ACGACGGCAGCCTCGGCGGGAGCACCAGCGCTAGCAGCGGCGGCGGCGTCCGGAGTGCCCGTG 

GCGCGCGGCGCAGCGATGCGGTCCCCACGGACGCGCGGCCGGCCCGGGCGCCCCCTGAGTCTT 
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CTGCTCGCCCTGCTCTGTGCCCTGCGAGCCAAGGTGTGCGGGGCCTCGGGTCAGTTTGAGCTGG 

AGATCCTGTCCATGCAGAACGTGAATGGAGAGCTACAGAATGGGAACTGTTGTGGTGGAGTCC 

GGAACCCTGGCGACCGCAAGTGCACCCGCGACGAGTGTGATACGTACTTCAAAGTGTGCCTCA 

AGGAGTATCAGTCCCGCGTCACTGCCGGGGGACCCTGCAGCTTCGGCTCAGGGTCTACGCCTGT 

CATCGGGGGTAACACCTTCAATCTCAAGGCCAGCCGTGGCAACGACCGTAATCGCATCGTACTG 

CCTTTCAGTTTCGCCTGGCCGAGGTCCTAeACTTTGCTGGTGGAGGCCTGGGATTCCAGTAATG 

ACACTATTCAACCTGATAGCATAATTGAAAAGGCTTCTCACTCAGGCATGATAAACCCTAGCCG 

GCAATGGCAGACACTGAAACAAAACACAGGGATTGCCCACTTCGAGTATCAGATCCGAGTGAC 

CTGTGATGACCACTACTATGGCTTTGGCTGCAATAAGTTCTGTCGTCCCAGAGATGACTTCTTTG 

GACATTATGCCTGTGACCAGAACGGCAACAAAACTTGCATGGAAGGCTGGATGGGTCCTGATT 

GCAACAAAGCTATCTGCCGACAGGGCTGCAGTCCCAAGCATGGGTCTTGTAAACTTCCAGGTG 

ACTGCAGGTGCCAGTACGGTTGGCAGGGCCTGTACTGCGACAAGTGCATCCCGCACCCAGGAT 

GTGTCCACGGCACCTGCAATGAACCCTGGCAGTGCCTCTGTGAGACCAACTGGGGTGGACAGC 

TCTGTGACAAAGATCTGAATTACTGTGGGACTCATCAGCCCTGTCTCAACCGGGGAACATGTAG 

CAACACTGGGCCTGACAAATACCAGTGCTCCTGCCCAGAGGGCTACTCGGGCCCCAACTGTGA 

AATTGCTGAGCATGCTTGTCTCrCTGACCCCTGCCATAACCGAGGCAGCTGCAAGGAGACCTCC 

TCAGGCTTTGAGTGTGAGTGTTCTCCAGGCTGGACTGGCCCCACGTGTTCCACAAACATCGATG 

ACTGTTCTCCAAATAACTGTTCCCATGGGGGCACCTGCCAGGATCTGGTGAATGGATTCAAGTG 

TGTGTGCCCGCCCCAGTGGACTGGCAAGACTTGTCAGTTAGATGCAAATGAGTGCGAGGCCAA 

ACCTTGTGTAAATGCCAGATCCTGTAAGAATCTGATTGCCAGCTACTACTGTGATTGCCTTCCTG 

GCTGGATGGGTCAGAACTGTGACATAAATATCAATGACTGCCTTGGCCAGTGTCAGAATGACG 

CCTCCTGTCGGGATTTGGTTAATGGTTATCGCTGTATCTGTCCACCTGGCTATGCAGGCGATCAC 

TGTGAGAGAGACATCGATGAGTGTGCTAGCAACCCCTGCTTGAATGGGGGTCACTGTCAGAAT 

GAAATCAACAGATTCCAGTGTCTCTGTCCCACTGGTITCTCTGGAAACCTCTGTCAGCTGGACA 

TCGATTACTGCGAGCCCAACCCTTGCCAGAATGGCGCCCAGTGCTACAATCGTGCCAGTGACTA 

TTTCTGCAAGTGCCCCGAGGACTATGAGGGCAAGAACTGCTCACACCTGAAAGACCACTGCCG 

TACCACCACCTGCGAAGTGATTGACAGCTGCACTGTGGCCATGGCCTCCAACGACACGCCTGAA 

GGGGTGCGGTATATCTCTTCTAACGTCTGTGGTCCCCATGGGAAGTGCAAGAGCCAGTCGGGAG 

GCAAATTCACCTGTGACTGTAACAAAGGCTTCACCGGCACCTACTGCCATGAAAATATCAACGA 

CTGCGAGAGCAACCCCTGTAAAAACGGTGGCACCTGCATCGATGGCGTTAACTCCTACAAGTGT 

ATCTGTAGTGACGGCTGGGAGGGAGCGCACTGTGAGAACAACATAAATGACTGTAGCCAGAAC 

CCTTGTCACTACGGGGGTACATGTCGAGACCTGGTCAATGACITITACTGTGACTGCAAAAATG 

GCTGGAAAGGAAAGACTTGCCATTCCCGTGACAGCCAGTGTGACGAAGCCACGTGTAATAATG 

GTGGTACCTGCTATGATGAAGTGGACACGTTTAAGTGCATGTGTCCCGGTGGCTGGGAAGGAA 

CAACTTGTAATATAGCTAGAAACAGTAGCTGCCTGCCGAACCCCTGTCATAATGGAGGTACCTG 

CGTGGTCAATGGAGACTCCTTCACCTGTGTCTGCAAAGAAGGCTGGGAGGGGCCTATTTGTACT 

CAAAATACCAACGACTGCAGTCCCCATCCTTGTTACAATAGCGGGACCTGTGTGGACGGAGAC 

AACTGGTATCGGTGCGAATGTGCCCCGGGTTTTGCTGGGCCAGACTGCAGGATAAACATCAATG 

AGTGCCAGTCTTCCCCITGTGCCTTTGGGGCCACCTGTGTGGATGAGATCAATGGCTACCAGTG 

TATCTGCCCTCCAGGACATAGTGGTGCCAAGTGCCATGAAGTTTCAGGGCGATCTTGCATCACC 

ATGGGGAGAGTGATACTrGATGGGGCCAAGTGGGATGATGACTGTAACACCTGCCAGTGCCTG 

AATGGACGGGTGGCCTGCTCCAAGGTCTGGTGTGGCCCGAGACCTTGCAGGCTCCACAAAAGC 

CACAATGAGTGCCCCAGTGGGCAGAGCTGCATCCCGGTCCTGGATGACCAGTGTTTCGTGCGCC 

CCTGCACTGGTGTTGGCGAATGTCGGTCCTCCAGCCTCCAGCCAGTGAAGACCAAGTGCACATC 

TGACTCCTATTACCAGGATAACTGTGCAAACATCACTTrCACCTTTAACAAAGAGATGATGTCT 

CCAGGTCTTACCACCGAACACATTTGCAGCGAATTGAGGAATTTGAATATCCTGAAGAATGTTT 

CTGCTGAATATrCGATCTACATAGCCTGTGAGCCTTCCCTGTCAGCAAACAATGAAATACACGT 

GGCCATCTCTGCAGAAGACATCCGGGATGATGGGAACCCTGTCAAGGAAATTACCGATAAAAT 

AATAGATCTCGTTAGTAAACGGGATGGAAACAGCTCACTTATTGCTGCGGTTGCAGAAGTCAG 

AGTTCAGAGGCGTCCTCTGAAAAACAGAACAGATTTCCTGGTTCCTCTGCTGAGCTCTGTCTTA 

ACAGTGGCTTGGGTCTGTTGCTTGGTGACAGCCTTCTACTGGTGTGTACGGAAGCGGCGGAAGC 

CCAGCAGCCACACTCACTCCGCCCCCGAGGACAACACCACCAACAATGTGCGGGAGCAGCTGA 

ACCAAATCAAAAACCCCATCGAGAAACACGGAGCCAACACGGTCCCCATTAAGGATTACGAGA 

ACAAAAACTCCAAAATGTCAAAAATCAGGACACACAACTCGGAAGTGGAGGAGGATGACATG 

GATAAACACCAGCAGAAAGTCCGCTTTGCCAAACAGCCAGTGTATACGCTGGTAGACAGAGAG 

GAGAAGGCCCCCAGCGGCACGCCGACAAAACACCCGAACTGGACAAATAAACAGGACAACAG 
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AGACTTGGAAAGTGCCCAGAGCTrGAACCGGATGGAATACATCGTATAGCAGACAGTGGGCT 
CCGCCATAGGTAGAGTTTGAGGGCACCGCGGGCCG 



Figure 5 



CTGCGGCCGGCCCGCGAGCTAGGCTGGGTTTTTTTTTTTCTC^ 

TGATCTAAAAGGGAATAA,AAGGCTGCGCATAATCATAATAATAA^GAAGGGGAGCGCGAGAGAA.GGA 
GAAAGCCGGGAGGTGGAAGAGGAGGGGGAGCGTCTCAAAGAAGCGATCAGAATAATAAAAGGAGGCGG 

CTCTTTGCCITCTCX^ 

TGCTCCAATCGGCGGAGTATATTAGAGCCGGGACGCGGCGGCCGCAGGGGCAGCGGCGACGGCAGCACG 
GCGGCAGCACCAGCGCGAACAGCAGCGGCGGCGTCCCGAGTGCCCGCGGCGCGCGGCGCAGCGATGCGT 
CCCCACGGACGCGCGGCCGGTCCGGGCGCCCCCTAAGCCTCCTGCTCGCCCTGCTCTGTGCCCTGCGAGC 

CAAGGTGTGTGGGGCCTCGGGTCAGTTCGAGTTGGAGATCCTGTCCATGCAGAACGTGAACGGGGAGCTG 

CAGAACGGGAACTGCTGCGGCGGCGCCCGGAACCCGGGAGACCGCAAGTGCACCCGCGACGAGTGTGAA 

CATACTTCAAAGTGTGCCTCAAGGAGTATCAGTCCCGCGTCACGGCCGGGGGGCCCTGCAGCTTCGGCTC 

AGGGTCCACGCCTGTCATCGGGGGCAACACCTTCAACCTCAAGGCCAGCCGCGGCAACGACCGCAACCC 

ATCGTGCTGCCTTTCAGTTTCGCCTGGCCGAGGTCCTATACGTTGCTTGTGGAGGCG 

ATGACACCGTTCAACCTGACAGTATTATTGAA 

GTGGCAGACGCTGAAGCAGAACACGGGCGTTGCCCACTTTGAGTATCAGATCCGCGTGACCTGTGATGAC 
TACTACTATGGCTTTGGCTGCAATAAGTTCTGCCGCCCCAGAGATGAOT 

ACCAGAATGGCAACAAAACTTGCATGGAAGGCTGGATGGGCCCCGAATGTAACAGAGCTATTTGCCGAA 

AGGCTGCAGTCCTAAGCATGGGTCTTGCAAACTCCCAGGTGACTGCAGGTGCCAGTATGGCTGGCAAGGC 

CTGTACTGTGATAAGTGCATCCCACACCCGGGATGCGTCCACGGCATCTGTAATGAGCCCTGGCAGTGCC 

TCTGTGAGACCAACTGGGGCGGCCAGCTCTGTGACAAAGATCTCAATTACTGTGGGACTCATCAGCCGTG 

TCTCAACGGGGGAACTTGTAGCAACACAGGCCCTGACAAATATCAGTGTTCCTGCCCTGAGGGGTATTCA 

GGACCCAACTGTGAAATTGCTGAGCACGCCTGCCTCTCTGATCCCTGTCACAACAGAGGCAGCTGTAAGG 

AGACCTCCCTGGGCTTTGAGTGTGAGTGTTCCCCAGGCTGGACCGGCCCCACATGCTCTACAAACATTGA 

TGACTGTTCTCCTAATAACTGTTCCCACGGGGGCACCTGCCA 

TGCCCCCCACAGTGGACTGGGAAAACGTGCCAGTTAGATGCAAATGAATGTGAGGCCAAACCTTGTGTAA 
ACGCCAAATCCTGTAAGAATCTCATTGCCAGCT^ 

TTGTGACATAAATATTAATGACrGCCTTGGCCAGTGTCAGAATGACGCCTCCTGTCGGGA 
GGTTATCGCTGTATCTGTCCACCTGGCTATGCAGGCGATCACTGTGAGAGAGACATCGATGAATGTGCCA 
GCAACCCCTGTTTGAATGGGGGTCACTGTCAGAATGAAATCAACAGATTCCAGTGTCTGTGTCCCACT 
TTTCTCTGGAAACCTCTGTCAGCTGGACATCGATTATTGT^ 

TGCTACAACCGTGCCAGTGACrATTTCTGCAAGTGCCCCGAGGACTATGAGGGCAAGAACTGCTCACACC 

TGAAAGACCACTGCCGCACGACCCCCTGTGAAGTGATTGACAGCTGCACAGTGGCCATGGCTTCCAACGA 

CACACCTGAAGGGGTGCGGTATATTTCCTCCAACGTCTGTGGTCCTCACGGGAAGTGCAAGAGTCAGTCG 

GGAGGCAAATTCACCTGTGACTGTAACAAAGGCTTCACGGGAACATACTGCCATGAAAATATTAATGACT 

GTGAGAGCAACCCTTGTAGAAACGGTGGCACTTGCATCGATGGTGTCAACTCCTACAAGTGCATCTGTAG 

TGACGGCTGGGAGGGGGCCTACTGTGAAACCAATATTAATGACTGCAGCCAGAACCCCTGCCACAATGG 

GGCACGTGTCGCGACCTGGTCAATGACTTCTACTGTGACTGTAAAAATGGGTGGAAAGGAAAGACCT 

ACTCACGTGACAGTCAGTGTGATGAGGCCACGTGCAACAACGGTGGCACCTGCTATGATGAGGGGGATC 

TITrAAGTGCATGTGTCCTGGCGGCTGGGAAGGAACAACCTGTAACATAGCCCGAAACAGTAGCTGCCT 

CCCAACCCCTGCCATAATGGGGGCACATGTGTGGTCAACGGCGAGTCCrrTTACGTGCGTCTGCAAGGAAG 

GCTGGGAGGGGCCCATCTGTGCTCAGAATACCAATGACTGCAGCCCTCATCCCTGTTACAACAGCGGCAC 

CTGTGTGGATGGAGACAACTGGTACCGGTGCGAATGTGCCCCGGGTTTTGCTGGGCCCGACTGCAGAATA 

AACATCAATGAATGCCAGTCITCACCnTGTGCCITrGGAG 

GGTGTGTCTGCCCTCCAGGGCACAGTGGTGCCAAGTGCCAGGAAGTTTCAGGGAGACCTTGCATCACCAT 

GGGGAGTGTGATACCAGATGGGGCCAAATGGGATGATGACTGTAATACCTGCCAGTGCCTGAATGGACG 

ATCGCCTGCTCAAAGGTCTGGTGTGGCCCTCGACCnTGCCTGCTCCACAAAGGGCACAGCGA 

GCGGGCAGAGCTGCATCCCCATCCTGGACGACCAGTGCTTCGTCCACCCCTGCACTGGTGTGGGCGAGTG 

TCGGTCTTCCAGTCTCCAGCCGGTGAAGACAAAGTGCACCTCTGACTCCTATTACCAGGATAACTGTGCG 

AACATCACATTTACCTTTAACAAGGAGATGATGTCACCAGGTCTTACTACGGAGCACATTTG 

TGAGGAATTTGAATATTTTGAAGAATGTTTCCGCTGAATATO 

TTCAGCGAACAATGAAATACATGTGGCCATTTCTGCTGAAGATATACGGGATGATGGGAACCCGATCAAG 
GAAATCACTGACAAAATAATCGATCTrGTTAGTAAACGTGATGGAAACAGCTCGCTGATTGCT 
CAGAAGTAAGAGTTCAGAGGCGGCCTCTGAAGAACAGAACAGAOT 
CTTAACTGTGGCTTGGATCTGTTGCIT 

GGCAGCCACACACACTCAGCCTCTGAGGACAACACCACCAACAACGTGCGGGAGCAGCTGAACCAGATA 
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CGTGTCATACTCGAGTCTGAGGCCGTTGCTGACTTAGAATCCCTGTGTTAATTrAAUl 1 1 i^aauuu 
SS£aGTCAA^^ 

catagttttatttatatttattgactcttgagttgtttttgtatattggttttatgatgacg 

T^AATCTATITITGTTG^ 

ttta^agaaa^ 

^A^TTra^^AACITAATAATCAAGAGCCTTAAAA 

GT^AGAA^G^AGGTTTTTGATAGCATTGTAAGCGTATC 

TGTTGCCTATAAGCCAAAATTAAGGTGTTTGAAAATAGTTTATTTTAAAACAAT 

CCCAGAATACTGATGGAATTTTTTTTGTACGACGTCAGATGTTr^ 
AACACGTTTTAAGGACTGACTGAGGCAGTTTGAGGATTAGTTTAGAACAGGTT^^ 

toag™g^c^cS 

AACTTGGAAGCACACCAATCTGACTTTGTAAATTCTGATTTC^ 

CACTTGTAGATTTGA'r riTrri 'l 1 1 aatctactgcatttagggagtattctaataagctagttgaatact 
tgaaccataa^Stctccagtaagatcactgtttagatttgcca 

GAAATCAAAGTGCTATTACGAAGTTCAAGATCAAAAAGGCTTATAAAACAGAG 

cattoagaccgtgaagatactitgtattgtcctattagtg 

TGTTGTrCTTGGCAATAAATTITGAAAAGTAATAriTATTAAATTITm 
GTGGCTCTrCTGAGCTTACGTAGTTCTACCGGCTITGCCGTGTC 

TGGTAATCGGGGTATAATAGGCTCTGCCTGACAGAGGGATGGAGGAAGAACTGAAAGGCTTITCAACCC 
AAAACTCATCTGGAGTC 

GAGCCTTGATTCCCTTGTTATTCAACAGCAAGTGTGAATACTGCTTGAATAAACACCACTGGATTAATGG 
AAAAAAAAAAAAAAAA 



Figure 6 

GGAGCGGGCGCGCGGCGGCGGCGGGGCCGCGGCGGGCGGGTCGCGGGGGCAATGCGGGCGCAGGGCCG 
GGGCCTTCCCCCCGGCGCTGCTGCTGCTGCTGGCGCTCTGGGTGCAGGCGGCGCGGCCCATGGGCTAm 
CGAGCTGCAGCTGAGCGCGCTGCGGAACGTGAACGGGGAGCTGCTGAGCGGCGCCTGCTGTGACGGCGC 

^cSacaacgcgcgcggggggctgcggccacgac^ 

AGTACCAGGCCAAGGTGACGCCCACGGGGCCCTGCAGCTACGGCCACGGCGCCACGCCCG^ 

caactccttctacctgccgccggcgggcgctgcgggggaccgagcgcgcgcgcggccccgggccggcgc 

GACCAGGACCCGGGCTTCGTCGTCATCCCCTTCCAGTTCGCCTGGCCGCGCTCC^ 

aggcctgggactgggacaacgataccaccccgaatgaggagctgctgatcgagcgagtgtcgcatgccg 

CATGATCAACCCGGAGGACCGCTGGAAGAGCCTGCACTTCAGCGGCCACGTGGCGCACCTGGAGCrcCG 
ATCCGCGTGCGCTGCGACGAGAACTACTACAGCGCCACTTGCAACAAGTC^ 

TTTTCGGCCACTACACCTGCGACCAGTACGGCAACAAGGCCTGCATGGACGGCTGGATGGGCAAGGAGTG 

CAAGGAAGCTGTGTGTAAACAAGGGTGTAATTTGCTCCACGGGGGATGCACCGTGCCTGGGGAGTGCAG 

TGCAGCTACGGCTGGCAAGGGAGGTTCTGCGATGAGTGTGTCCCCTACCCCGGCTGCGTG^ 

GTGTGGAGCCCTGGCAGTGCAACTGTGAGACCAACTGGGGCGGCCTGCTCTGTGACAAAGACCTGAACTA 

CTGTGGCAGCCACCACCCCTGCACCAACGGAGGCACGTGCATCAACGCCGAGCCTGACCAGTACCGCTGC 

ACCTGCCCTGACGGCTACTCGGGCAGGAACTGTGAGAAGGCTGAGCACGCCTGCACCTCCAACCCGTGTG 

CCAACGGGGGCTCTrGCCATGAGGTGCCGTCCGGCTTCGAATGCCACTGCCCATCGGGCTGGAGCGjGGCC 

CACCTGTGCCCTTGACATCGATGAGTGTGCTTCGAACCCGTGTGCGGCCGGTGGCACCTGTGTGGACCAG 

GTGGACGGCTTTGAGTGCATCTGCCCCGAGCAGTGGGTGGGGGCCACCTGCCAGCTGGACGTCAACGACT 

GTGAAGGGAAGCCATGCCTTAACGCirTTTCTTGCAAAAACCKiATTGGCGGCTATTACT 

CCCGGGCTGGAAGGGCATCAACTGCCATATCAACGTCAACGACTGTCGCGGGCAGTGTCAGCATGGGGC 

ACCTGCAAGGACCTGGTGAACGGGTACCAGTGTGTGTGCCCACGGGGCTTCGGAGGCCGGCATTGCGAGC 

TGGAACGAGACAAGTGTGCCAGCAGCCCCTGCCACAGCGGCGGCCTCTGCGAGGACCTGGCCGACGGCT 

CCACTGCCACrGCCCCCAGGGCTTCTCCGGGCCTCTCTGTGAGGTGGATGTCGACCTTTGTGAGCCAAGC 

CCCTGCCGGAACGGCGCTCGCTGCTATAACCTGGAGGGTGACTATTACTGCGCCTGCCCTGATGACTTTG 

GTGGCAAGAACTGCTCCGTGCCCCGCGAGCCGTGCCCTGGCGGGGCCTGCAGAGTGATCGATGGCTGCGG 
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GTCAGACGCGGGGCCTGGGATGCCTGGCACAGCAGCCTCCGGCGTGTGTGGCCCCCATGGACGCTGCGTC 

AGCCAGCCAGGGGGCAACTTTTCCTGCATCTGTGACAGTGGCTTTACTGGCACCTACTGCCATGAGAACA 

TTGACGACTGCCTGGGCCAGCCCTGCCGCAATGGGGGCACATGCATCGATGAGGTGGACGCCTTCCGCTG 

CrTCTGCCCCAGCGGCTGGGAGGGCGAGCTCTGCGACACCAATCCCAACGACTGCCTTCCCGATCCCTGC 

CACAGCCGCGGCCGCTGCTACGACCTGGTCAATGACTTCTACTGTGCGTGCGACGACGGCTGGAAGGGCA 

AGACCTGCCACTCACGCGAGTTCCAGTGCGATGCCTACACCTGCAGCAACGGTGGCACCTGCTACGACAG 

CGGCGACACCTTCCGCTGCGCCTGCCCCCCCGGCTGGAAGGGCAGCACCTGCGCCGTCGCCAAGAACAGC 

AGCTGCCTGCCCAACCCCTGTGTGAATGGTGGCACCTGCGTGGGCAGCGGGGCCTCCTTCTCCTGCATCT 

GCCGGGACGGCTGGGAGGGTCGTACTTGCACrCACAATACCAACGACTGCAACCCTCTGCCTTGCTACAA 

TGGTGGCATCTGTGTTGACGGCGTCAACTGGTTCCGCTGCGAGTGTGCACCTGGCTTCGCGGGGCCTGAC 

TGCCGCATCAACATCGACGAGTGCCAGTCCTCGCCCTGTGCCTACGGGGCCACGTGTGTGGATGAGATCA 

ACGGGTATCGCTGTAGCTGCCCACCCGGCCGAGCCGGCCCCCGGTGCCAGGAAGTGATCGGGTTCGGGAG 

ATCCTGCTGGTCCCGGGGCACTCCGTTCCCACACGGAAGCTCCTGGGTGGAAGACTGCAACAGCTGCCGC 

TGCCTGGATGGCCGCCGTGACTGCAGCAAGGTGTGGTGCGGATGGAAGCCTTGTCTGCTGGCCGGCCAGC 

CCGAGGCCCTGAGCGCCCAGTGCCCACTGGGGCAAAGGTGCCTGGAGAAGGCCCCAGGCCAGTGTCTGG 

ACCACCCTGTGAGGCCTGGGGGGAGTGCGGCGCAGAAGAGCCACCGAGCACCCCCTGCCTGCCACGCTC 

GGCCACCTGGACAATAACTGTGCCCGCCTCACCTTGCATTTCAACCGTGACCACGTGCCCCAGGGCACCA 

CGGTGGGCGCCATTTGCTCCGGGATCCGCTCCCTGCCAGCCACAAGGGCTGTGGCACGGGACCGCCTGCT 

GGTGTTGCTTTGCGACCGGGCGTCCTCGGGGGCCAGTGCCGTGGAGGTGGCCGTGTCCTTCAGCCCTGCC 

AGGGACCTGCCTGACAGCAGCCTGATCCAGGGCGCGGCCCACGCCATCGTGGCCGCCATCACCCAGCGG 

GGAACAGCTCACTGCTCCTGGCTGTCACCGAGGTCAAGGTGGAGACGGTTGTTACGGGCGGCTCTTCCAC 

AGGTCTGCTGGTGCCTGTGCTGTGTGGTGCCTTCAGCGTGCTGTGGCTGGCGTGCGTGGTCCTGTGCGTG 

TGGTGGACACGCAAGCGCAGGAAAGAGCGGGAGAGGAGCCGGCTGCCGCGGGAGGAGAGCGCCAACAC 

AGTGGGCCCCGCTCAACCCCATCCGCAACCCCATCGAGCGGCCGGGGGGCCACAAGGACGTGCTCTACCA 

GTGCAAGAACTTCACGCCGCCGCCGCGCAGGGCGGACGAGGCGCTGCCCGGGCCGGCCGGCCACGCGGC 

GTCAGGGAGGATGAGGAGGACGAGGATCTGGGCCGCGGTGAGGAGGACTCCCTGGAGGCGGAGAAGTTC 

TCTCACACAAATTCACCAAAGATCCTGGCCGCTCGCCGGGGAGGCCGGCCCACTGGGCCTCAGGCCCCAA 

AGTGGACAACCGCGCGGTCAGGAGCATCAATGAGGCCCGCTACGCCGGCAAGGAGTAGGGGCGGCTGCG 

CTGGGCCGGGACCCAGGGCCCTCGGTGGGAGCCATGCCGTCTGCCGGACCCGGAGCCGAGGCATGTGCT 

AGTTTCTrTTATTTTGTGTAAAAAAACCACCAAAAACAA 

CCITGTATAAATTATTCAGTAACTGTC^^ 

TAAAGTrTCCGTGCGTGGCACTCGCTGTATGAAAGGAGAGAGCAAAGGGTGTCTGCGTCGTCACCAAATC 
GTAGCGTTTGTTACCAGAGGTTGTGCACTGTTTACAGAATC^ 

GTGGCTCCAGGCCAAAGTGCCGGTGAGACCCATGGCTGTGTTGGTGTGGCCCATGGCTGTTGGTGGGACC 

CGTGGCTGATGGTGTGGCCTGTGGCTGTCGGTGGGACTCGTGGCTGTCAATGGGACCTGTGGCTGTCGGT 

GGGACCTACGGTGGTCGGTGGGACCCTGGTTATTGATGTGGCCCTGGCTGCCGGCACGGCCCGTGGCTGT 

TGACGCACCTGTGGTTGTTAGTGGGGCCTGAGGTCATCGGCGTGCCCAAGGCCGGCAGGTCAACCTCGGG 

CTTGCTGGCCAGTCCACCCTGCCTGCCGTCTGTGCTTCCTCCTGCCCAGAACGCCCGCTCCAGCGATCTC 

TCCACTGTGCTTTCAGAAGTGCCCTTCCTGCTGCGCAGTTCTCCCATCCTGGGACGGCGGCAGTATTGAA 

GCTCGTGACAAGTGCCTTCACACAGACCCCTCGCAACTGTCCACGCGTGCCGTGGCACCAGGCGCTGCCC 

ACCTGCCGGCCCCGGCCGCCCCTCCTCGTGAAAGTGCATTTTTGTAAATGTGTACATATTAAAGGAAGCA 
CTCTGTATATTTGATTGAATAATGCCACCAAAAAAAAAAAAAAAAAAAAATTCCTGCCC 



Figure 7 

TCGAGGCGGCGATGCGGGCACGCGGCTGGGGACGCCTGCCTCGGCGGCTGCTGCTGCTACTGG 

TTCTGTGCGTGCAGGCGACGCGGCCCATGGGCTATTTCGAGCTGCAGCTGAGCGCGCTGCGGAA 

CGTGAACGGGGAGCTGCTGAGCGGCGCCTGCTGTGACGGCGACGGCCGGACGACGCGCGCGGG 

GGGCTGCGGCCGCGACGAGTGCGACACGTACGTGCGCGTGTGCCTTAAGGAGTACCAGGCCAA 

GGTGACGCCCACGGGGCCCTGCAGCTACGGCTACGGCGCCACGCCCGTGCTGGGTGGCAACTC 

CTTCTACCTGCCGCCGGCGGGCGCTGCGGGGGACCGAGCGCGCGCGCGGTCTCGGACCGGCGG 

CCACCAGGACCCGGGCCTCGTCGTCATTCCCTTTCAGTTCGCCTGGCCGCGTTCTTTCACCCTCA 

TCGTGGAGGCCTGGGACTGGGACAATGACACCACTCCAGATGAGGAGCTGCTGATTGAGCGGG 

TGTCGCACGCTGGCATGATCAACCCCGAGGACCGCTGGAAGAGCCTGCACTTCAGCGGCCACG 

TGGCACACCTGGAGCTGCAGATCCGAGTGCGCTGTGATGAGAACTACTACAGTGCCACCTGCA 

ACAAGTTCTGCCGGCCCCGCAACGACTTCTITGGCCACTATACCTGCGACCAGTACGGCAACAA 

GGCCTGCATGGATGGCTGGATGGGCAAAGAATGCAAAGAAGCCGTGTGTAAACAAGGATGTAA 

TTTGCTCCACGGGGGATGCACTGTGCCTGGGGAGTGCAGGTGCAGCTACGGCTGGCAGGGCAA 
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GTTCTGTGACGAGTGTGTCCCCTACCCTGGCTGCGTGCATGGCAGCTGTGTGGAGCCCTGGCAC 

TGTGACTGTGAGACCAACTGGGGTGGCCTGCTCTGCGACAAAGACCTGAACTACTGTGGCAGC 

CACCACCCCTGTGTCAACGGGGGTACCTGCATCAATGCTGAGCCTGACCAATACCTCT^ 

GCCCAGATGGCTACTTGGGCAAGAACTGTGAGCGGGCTGAGCACGCCTGTGCCTCCAACCCGT 

GTGCCAATGGGGGCTCTTGCCACGAAGTGCCATCTGGCTTrGAATGCCACTGTCCGTCAGGATG 

GAGCGGACCCACCTGTGCGCTCGACATTGATGAGTGTGCCTCTAACCCATGTGCAGCGGGTGGT 

ACCTGCGTGGATCAGGTGGACGGCTTCGAGTGCATCTGCCCGGAGCAGTGGGTGGGGGCTACT 

TGCCAGCTGGACGCCAATGAGTGTGAAGGGAAGCCGTGCCTTAATGCTTITrOTGCAAAAACC 

TGATTGGCGGCTATTACTGTGATTGCCTCCCGGGCTGGAAGGGCATCAACTGCCAAATCAACAT 

CAACGATTGTCATGGGCAGTGTCAGCATGGGGGCACCTGCAAGGACCTGGTCAATGGGTACCA 

GTGTGTGTGCCCGCGGGGCTTTGGAGGTCGCCATTGCGAACTAGAGTACGACAAGTGTGCCAG 

CAGCCCCTGCCGCCGGGGTGGCATCTGCGAGGACCTGGTGGATGGCTTCCGCTGCCACTGCCCA 

CGGGGCCTCTCTGGGCTGCACTGTGAGGTGGACATGGATCTCTGTGAACCAAGCCCCTGCCTCA 

ACGGTGCTCGCTGCTACAACCTTGAGGGTGACTACTACTGCGCCrGCCCAGAAGACTTTGGTGG 

CAAGAACTGCTCAGTGCCCAGGGACACATGCCCTGGCGGGGCATGTAGAGTGATCGATGGCTG 

CGGGTTCGAGGCAGGGTCCAGGGCACGCGGTGTCGCACCCTCTGGTATATGTGGCCCTCACGG 

GCACTGCGITAGCCTGCCTGGGGGAAACTTCTCCTGCATCTGTGACAGCGGCTTCACAGGCACC 

TACTGCCATGAAAACATTGACGACTGCATGGGCCAGCCCTGCCGCAACGGGGGCACGTGCATT 

GACGAAGTGGACTCCTTCCGCTGCTrCTGCCCCAGTGGCTGGGAAGGAGAACTCTGTGACATCA 

ATCCCAACGACTGCCTCCCCGACCCCTGCCACAGCCGCGGCCGCTGCTATGACCTGGTCAATGA 

CTTCTACTGTGCCTGTGACGATGGCTGGAAGGGCAAGACCTGCCACTCACGCGAGTTCCAGTGT 

GACGCCTACACCTGCAGCAACGGTGGCACATGCTATGACAGCGGCGACACCTTCCGCTGCGCG 

TGCCCTCCGGGCTGGAAGGGCAGCACCTGCACCATCGCCAAGAACAGCAGCTGTGTGCCCAAT 

CCCTGTGTGAATGGAGGCACCTGCGTGGGTAGCGGAGACTCTTTCTCCTGCATCTGCCGGGATG 

GCTGGGAGGGCCGCACCTGCACACATAACACCAATGACTGCAACCCTCTGCCCTGCTATAACG 

GAGGCATCTGTGTTGATGGCGTCAACTGGTTCCGCTGCGAGTGTGCGCCTGGCTTTGCGGGTCC 

TGACTGCCGTATCAACATTGATGAGTGCCAGTCCTCGCCCTGTGCCTACGGAGCCACGTGTGTG 

GATGAGATCAACGGGTACCGCTGCAGCTGCCCACCAGGTCGTTCTGGCCCCAGGTGCCAGGAA 

GTGGTCATATTCACGAGGCCCTGCTGGTCCCGGGGAATGTCCTTCCCGCATGGGAGTTCCTGGA 

TGGAAGACTGCAACAGCTGCCGCTGCCTGGATGGCCACCGGGATTGTAGCAAGGTATGGTGCG 

GATGGAAGCCTTGCCTGCTCTCTGGTCAGCCCAGCGATCCGAGTGCCCAGTGCCCCCCAGGGCA 

GCAATGTCAGGAGAAGGCCGTGGGTCAGTGCTTGCAGCCACCCTGTGAGAACTGGGGGGAGTG 

TACAGCGGAGGAGCCTCTGCCACCCAGCACCCCCTGTCAGCCACGGAGCAGTCATTTGGACAA 

CAACTGTGCCCGACTCACACTGCGCTTCAACCGTGATCAAGTGCCTCAGGGCACCACCGTGGGC 

GCTATCTGCTCTGGAATCCGAGCCTTGCCTGCCACGAGGGCGGCGGCACACGACCGCCTCCTCC 

TGCTGCnTGTGATCGAGCATCCTCGGGGGCCAGTGCTGTGGAGGTGGCTATGTCTTTCAGCCC 

TGCAAGGGACCTGCCTGACAGCAGCCTGATCCAGAGCACAGCCCACGCCATCGTGGCTGCTAT 

CACTCAGAGAGGAAATAGCTCACTGCTGCTGGCTGTCACCGAGGTCAAGGTGGAAACAGTTGT 

TATGGGTGGCTCTTCCACAGGTCTGTTGGTGCCCGTGCTGTGCAGCGTGTTCAGTGTGCTGTGGC 

TCGCCTGTGTGGTTATCTGCGTATGGTGGACACGAAAGCGCAGGAAAGAACGTGAGAGGAGCC 

GGCTACCACGGGATGAGAGCACCAACAACCAGTGGGCCCCGCTCAATCCCATCCGCAACCCCA 

TTGAGCGGCCAGGCGGCAGCGGTCTGGGAACTGGGGGCCACAAGGACATACTCTACCAGTGCA 

AAAACTTCACACCGCCGCCCCGCAGGGCAGGCGAGGCACTGCCCGGGCCAGCTGGCCATGGGG 

CTGGTGGGGAGGACGAGGAGGATGAAGAGCTGAGCCGTGGAGATGGGGACTCCCCAGAGGCA 

GAGAAGTTCATCTCACACAAGTTCACCAAAGACCCCAGCTGCTCCCTCGGAAGGCCAGCCTGCT 

GGGCTCCAGGGCCCAAAGTGGACAACCGCGCCGTCAGAAGTACCAAGGACGTGCGCCGTGCTG 

GCAGGGAGTAGCCAGCCACCAGGCTGGCACCCAGAACCCTTGCTGGCACCACGCTGCCTGCCG 

GACCATAGGAGGCCAAGGCCGTGTGCATAGTTTCTTTATTTTGTGTAAAAAACAAAACCAAAAC 

CAAAAAACAAATGTTTATTTTITACGTTTCnTrAACCTrGTATAA 

CGGAAAACAACGGAGTATTCTCGGATCATTGCTATIT1TGTAAAGTTTCCGCGTCCGCACGCAC 
TGTGGCAGGAGAGCAGGGCGTGTGTATGTGTGTGTGTGTGTGTCCTCACC 

Figure 8 

GAAGGCCATGGTCTCCCCACGGATGTCCGGGCTCCTCTCCCAGACTGTGATCCTAGCGCTCATTTrCCTC 

CCCCAGACACGGCCCGCTGGCGTCTTCGAGCTGCAGATCCACTCTTTCGGGCCGGGTCCAGGCCCTGGGG 

CCCCGCGGTCCCCCTGCAGCGCCCGGCTCCCCTGCCGCCTCTTCTTCAGAGTCTGCCTGAAGCCTGGGCT 
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CTCAGAGGAGGCCGCCGAGTCCCCGTGCGCCCTGGGCGCGGCGCTGAGTGCGCGCGGACCGGTCTACACC 

GAGCAGCCCGGAGCGCCCGCGCCTGATCTCCCACTGCCCGACGGGCTCTTGCAGGTGCCCTTCCGGGACG 

CCTGGCCTGGCACCTTCTCTTTCATCATCGAAACCTGGAGAGAGGAGTTAGGAGACCAGATTGGAGGGCC 

CGCCTGGAGCCTGCTGGCGCGCGTGGCTGGCAGGCGGCGCTTGGCAGCCGGAGGCCCGTGGGCCCGGGC 

ATTCAGCGCGCAGGCGCCTGGGAGCTGCGCTTCTCGTACCGCGCGCGCTGCGAGCCGCCTGCCGTCGGGA 

CCGCGTGCACGCGCCTCTGCCGTCCGCGCAGCGCCCCCTCGCGGTGCGGTCCGGGACTGCGCCCCTGCGC 

ACCGCTCGAGGACGAATGTGAGGCGCCGCTGGTGTGCCGAGCAGGCTGCAGCCCTGAGCATGGCTTCTGT 

GAACAGCCCGGTGAATGCCGATGCCTAGAGGGCTGGACTGGACCCCTCTGCACGGTCCCTGTCrCCACCA 

GCAGCTGCCTCAGCCCCAGGGGCCCGTCCTCTGCTACCACCGGATGCCTTGTCCCTGGGCCTGGGCCCTG 

TGACGGGAACCCGTGTGCCAATGGAGGCAGCTGTAGTGAGACACCCAGGTCCTTTGAATGCACCTGCCCG 

CGTGGGTTCTACGGGCTGCGGTGTGAGGTGAGCGGGGTGACATGTGCAGATGGACCCTGCTTCAACGGCG 

GCITGTGTGTCGGGGGTGCAGACCCTGACTCTGCCTACATCTGCCACTGCCCACCTGGTTTCCAAGGCTC 

CAACTGTGAGAAGAGGGTGGACCGGTGCAGCCTGCAGCCATGCCGCAATGGCGGACTCTGCCTGGACCG 

GGCCACGCCCTGCGCTGCCGCTGCCGCGCCGGCTTCGCGGGTCCTCGCTGCGAGCACGACCTGGACGACT 

GCGCGGGCCGCGCCTGCGCTAACGGCGGCACGTGTGTGGAGGGCGGCGGCGCGCACCGCTGCTCCTGCC 

GCTGGGCTTCGGCGGCCGCGACTGCCGCGAGCGCGCGGACCCGTGCGCCGCGCGCCCCTGTGCTCACGGC 

GGCCGCTGCTACGCCCACTTCTCCGGCCTCGTCTGCGCTTGCGCTCCCGGCTACATGGGAGCGCGGTGTG 

AGTTCCCAGTGCACCCCGACGGCGCAAGCGCCTTGCCCGCGGCCCCGCCGGGCCTCAGGCCCGGGGACCC 

TCAGCGCTACCTTTTGCCTCCGGCTCTGGGACTGCTCGTGGCCGCGGGCGTGGCCGGCGGTGCGCTCTTG 

CTGGTCCACGTGCGCCGCCGTGGCCACTCCCAGGATGCTGGGTCTCGCTTGCTGGCTGGGACCCCGGAGC 

CGTCAGTCCACGCACTCCCGGATGCACTCAACAACCTAAGGACGCAGGAGGGTTCCGGGGATGGTCCGG 

CTCGTCCGTAGATTGGAATCGCCCTGAAGATGTAGACCCTCAAGGGATTTATGTCATATCTGCTCCTTCC 

ATCTACGCTCGGGAGGTAGCGACGCCCCTnTCCCCCCGCTACACACTGGGCGCGCTGGGCAGAGGCAGC 

ACCTGCTTTITCCCTACCC1TCCTCGATTCTGTCCGTGAAATGAATTGGGTAGAGTCTCTGGAAGGTTTT 

AAGCCCATTTTCAGTTCTAACITACTTTC^ 
ATCTTCTCTTT 

Figure 9 

AAACCGGAACGGGGCCCAACTTCTGGGGCCTGGAGAAGGGAAACGAAGTCCCCCCCGGTTTCCCGAGGT 

GCCTTTCCTCGGGCATCCTTGGTTTCGGCGGGACTTCGCAGGGCGGATATAAAGAACGGCGCCTTTGGGA 

AGAGGCGGAGACCGGCTTTAAAGAAAGAAGTCTTGGTCCTGCGGCTTGGGCGAGGCAAGGGCGAGGCAG 

GGCGCTTTCTGCCGACGCTCCCCGTGGCCCTACGATCCCCCGCGCGTCCGCCGCTGTTCTAAGGAGAGAA 

GTGGGGGCCCCCCAGGCTCGCGCGTGGAGCGAAGCAGCATGGGCAGTCGGTGCGCGCTGGCCCTGGCGT 

GCTCTCGGCCTTGCTGTGTCAGGTCTGGAGCTCTGGGGTGTTCGAACTGAAGCTGCAGGAGTTCGTCAAC 

AAGAAGGGGCTGCTGGGGAACCGCAACTGCTGCCGCGGGGGCGCGGGGCCACCGCCGTGCGCCTGCCGA 

CCTTCTTCCGCGTGTGCCTCAAGCACTACCAGGCCAGCGTGTCCCCCGAGCCGCCCTGCACCTACGGCAG 

CGCCGTCACCCCCGTGCTGGGCGTCGACTCCTTCAGTCTGCCCGACGGCGGGGGCGCCGACTCCGCGTTC 

AGCAACCCCATCCGOTCCCCTTCGGCTrCACCTGGCCGGGCACCTTCTCTCTGATTATTGAAGCTCTCC 

ACACAGATTCTCCTGATGACCTCGCAACAGAAAACCCAGAAAGACTCATCAGCCGCCTGGCCACCCAGAG 

GCACCTGACGGTGGGCGAGGAGTGGTCCCAGGACCTGCACAGCAGCGGCCGCACGGACCTCAAGTACTC 

TACCGCTTCGTGTGTGACGAACACTACTACGGAGAGGGCTGCTCCGTTTTCTGCCGTCCCCGGGACGATG 

CCTTCGGCCACTTCACCTGTGGGGAGCGTGGGGAGAAAGTGTGCAACCCTGGCTGGAAAGGGCCCTACTG 

CACAGAGCCGATCTGCCTGCCTGGATGTGATGAGCAGCATGGATTTTGTGACAAACCAGGGGAATGCAAG 

TGCAGAGTGGGCTGGCAGGGCCGGTACTGTGACGAGTGTATCCGCTATCCAGGCTGTCTCCATGGCACCT 

GCCAGCAGCCCTGGCAGTGCAACTGCCAGGAAGGCTGGGGGGGCCTTITCTGCAACCAGGACCTGAACTA 

CTGCACACACCATAAGCCCTGCAAGAATGGAGCCACCTGCACCAACACGGGCCAGGGGAGCTACACTTC 

TCrTGCCGGCCTGGGTACACAGGTGCCACCTGCGAGCTGGGGATTGACGAGTGTGACCCCAGCCCTTGTA 

AGAACGGAGGGAGCTGCACGGATCTCGAGAACAGCTACTCCTGTACCTGCCCACCCGGCTTCTACGGCAA 

AATCTGTGAATTGAGTGCCATGACCTGTGCGGACGGCCCTTGCTTTAACGGGGGTCGGTGCTCAGACAGC 

CCCGATGGAGGGTACAGCTGCCGCTGCCCCGTGGGCTACTCCGGCTTCAACTGTGAGAAGAAAATTGACT 

ACTGCAGCTCITCACCCTGTTCTAATGGTGCCAAGTGTGTGGACCTCGGTGATGCCTACCTGTGCCGCTG 

CCAGGCCGGCTTCTCGGGGAGGCACTGTGACGACAACGTGGACGACTGCGCCTCCTCCCCGTGCGCCAAC 

GGGGGCACCTGCCGGGATGGCGTGAACGACTTCTCCTGCACCTGCCCGCCTGGCTACACGGGCAGGAACT 

GCAGTGCCCCCGTCAGCAGGTGCGAGCACGCACCCTGCCACAATGGGGCCACCTGCCACCAGAGGGGCA 

CGGCTATGTGTGCGAATGTGCCCGAAGCTACGGGGGTCCCAACTGCCAGTTCCTGCTCCCCGAGCTGCCC 

CCGGGCCCAGCGGTGGTGGACCTCACTGAGAAGCTAGAGGGCCAGGGCGGGCCATTCCCCTGGGTGGCG 

TGTGCGCCGGGGTCATCCTTGTCCTCATGCTGCTGCTGGGCTGTGCCGCTGTGGTGGTCTGCGTCCGGCT 

GAGGCTGCAGAAGCACCGGCCCCCAGCCGACCCCTGCCGGGGGGAGACGGAGACCATGAACAACCTGGC 

AACTGCCAGCGTGAGAAGGACATCTCAGTCAGCATCATCGGGGCCACGCAGATCAAGAACACCAACAAA 

AGGCGGACTTCCACGGGGACCACAGCGCCGACAAGAATGGCTTCAAGGCCCGCTACCCAGCGGTGGACA 

TAACCTCGTGCAGGACCTCAAGGGTGACGACACCGCCGTCAGGGACGCGCACAGCAAGCGTGACACCAG 



WO 03/012082 PCT/GB02/03409 

11/41 

TGCCAGCCCCAGGGCTCCTCAGGGGAGGAGAAGGGGACCCCGACCAC^ 

AAAGAAAAAGGCCGGACTCGGGCTGTTCAACTTCAAAAGACACCAAGTACCAGTCGGTGTACGTCAT^ 
CGAGGAGAAGGATGAGTGCGTCATAGCAACTGAGGTGTAAAATGGAAGTGAGATGGCAAGACTCCCGTT 
CTCITAAAATAAGTAAAATTCCAAGGATATATGCCCCAACGAATGCTGCTGAAGAGGAGGGAGG 
GGACTGCTGCTGAGAAACCGAGTTCAGACCGAGCAGGTTCTCCTCCTGAGGTCCTCGACGCCTGCCGACA 

GCCTGTCGCGGCCCGGCCGCCTGCGGCACTGCCTTCCXjTGACGTCGCCGTTGCACTATGG 
TrAAGAGAATATATATTTAAATGGGTGAACTGAATTACGCCTAAGAAGCATGCACTGCCTGAGTGTATAT 

TTTGGATTCTTATGAGCCAGTCTTTTCTTGAATTAGAAACACAAAC^ 

ACGAAGATGTGCTTTTTCTAGATGGAAAAGATGTGTGTTATTTTTTGGA 

ATATCTGTAAAGCTTGAGTATTTTGTGATGTTCGTIT^ 

GGCACTTCGGGTCTATGTGACTATATTTTTTTGTATATAAATGTATTTATGGAATATTGTGCCAATGTTA 

TTTGAGTTTTTTACTGTTTTGTTAATGM 

AGGAATTC 

Figure 10 

ATGGCGGCAGCGTCCCGGAGCGCCTCTGGCTGGGCGCTACTGCTGCTGGTGGCACTTTGGCAGCAGCGCG 

CGGCCGGCTCCGGCGTCTTCCAGCTGCAGCTGCAGGAGTTCATCAACGAGCGCGGCGTACTGGCCAGTGG 

GCGGCCTTGCGAGCCCGGCTGCCGGACTrTCTTCCGCGTCTGCCTTAAGCACTTCCAGGCGGTCGTCTCG 

CCCGGACCCTGCACCTTCGGGACCGTCTCCACGCCGGTATTGGGCACCAACTCCTTCGCTGTCCGGGACG 

ACAGTAGCGGCGGGGGGCGCAACCCTCTCCAACTGCCCTTCAATTTCACCTGGCCGGGTACCTTCTCGCT 

CATCATCGAAGCTTGGCACGCGCCAGGAGACGACCTGCGGCCAGAGGCCTTGCCACCAGATGCACTCATC 

AGCAAGATCGCCATCCAGGGCTCCCTAGCTGTGGGTCAGAACTGGTTATTGGATGAGCAAACCAGCACCC 

TCACAAGGCTGCGCTACTCTTACCGGGTCATCTGCAGTGACAACTACTATGGAGACAACTGCTCCCGCCT 

GTGCAAGAAGCGCAATGACCACTTCGGCCACTATGTGTGCCAGCCAGATGGCAACTTGTCCTGCCTGCCC 

ggttggactggggaatattgccaacagcctatctgtctttcgggctgtcatgaacagaatggctactgca 

gcaagccag'cagagtgcctctgccgcccaggctggcagggccggctgtgtaacgaatgcatcccccacaa 

tggctgtcgccacggcacctgcagcactccctggcaatgtacttgtgatgagggctggggaggcctgttt 

TGTGACCAAGATCTCAACTACTGCACCCACCACTCCCCATGCAAGAATGGGGCAACGTGCTCCAACAGTG 

GGCAGCGAAGCTACACCTGCACCTGTCGCCCAGGCTACACTGGTGTGGACTGTGAGCTGGAGCTCAGCGA 

GTGTGACAGCAACCCCTGTCGCAATGGAGGCAGCTGTAAGGACCAGGAGGATGGCTACCACTGCCTGTGT 

CCTCCGGGCTACTATGGCCTGCATTGTGAACACAGCACCTTGAGCTGCGCCGACTCCCCCTGCTTCAATG 

GGGGCTCCTGCCGGGAGCGCAACCAGGGGGCCAACTATGCTTGTGAATGTCCCCCCAACTTCACCGGCTC 

CAACTGCGAGAAGAAAGTGGACAGGTGCACCAGCAACCCCTGTGCCAACGGGGGACAGTGCCTGAACCA 

GGTCCAAGCCGCATGTGCCGCTGCCGTCCTGGATTCACGGGCACCTACTGTGAACTCCACGTCAGCGACT 

GTGCCCGTAACCCTTGCGCCCACGGTGGCACTTGCCATGACCTGGAGAATGGGCTCATGTGCACCTGCCC 

TGCCGGCTTCTCTGGCCGACGCTGTGAGGTGCGGACATCCATCGATGCCTGTGCCTCGAGTCCCTGCTTC 

AACAGGGCCACCTGCTACACCGACCTCTCCACAGACACCTTrGTGTGCAACTGCCCTTATGGCTTTGTGG 

GCAGCCGCTGCGAGTTCCCCGTGGGCTTGCCGCCCAGCTTCCCCTGGGTGGCCGTCTCGCTGGGTGTGGG 

GCTGGCAGTGCTGCTGGTACTGCTGGGCATGGTGGCAGTGGCTGTGCGGCAGCTGCGGCTTCGACGGCCG 

GACGACGGCAGCAGGGAAGCCATGAACAACTTGTCGGACTTCCAGAAGGACAACCTGATTCCTGCCGCC 

AGCTTAAAAACACAAACCAGAAGAAGGAGCTGGAAGTGGACTGTGGCCTGGACAAGTCCAACTGTGGCA 

ACAGCAAAACCACACATTGGACTATAATCTGGCCCCAGGGCCCCTGGGGCGGGGGACCATGCCAGGAAG 

TTTCCCCACAGTGACAAGAGCTTAGGAGAGAAGGCGCCACTGCGGTTACACAGTGAAAAGCCAGAGTGC 

GGATATCAGCGATATGCTCCCCCAGGGACTCCATGTACCAGTCTGTGTGTTTGATATCAGAGGAGAGGAA 

TGAATGTGTCATTGCCACGGAGGTATAA 

Figure 11 

CTCGCAGGCTAGGAACCCGAGGCCAAGAGCTGCAGCCAAAGTCACTTGGGTGCAGTGTACTCCCTCACTA 

GCCCGCTCGAGACCCTAGGATTTGCTCCAGGACACGTACTTAGAGCAGCCACCGCCCAGTCGCCCTCACC 

TGGATTACCTACCGAGGCATCGAGCAGCGGAGTTTTTGAGAAGGCGACAAGGGAGCAGCGTCCCGAGGG 

AATCAGCTTTrCAGGAACTCGGCTGGCAGACGGGACTTGCGGGAGAGCGACATCCCTAACAAGCAGATTC 

GGAGTCCCGGAGTGGAGAGGACACCCCAAGGGATGACGCCTGCGTCCCGGAGCGCCTGTCGCTGGGCGT 

ACTGCTGCTGGCGGTACTGTGGCCGCAGCAGCGCGCTGCGGGCTCCGGCATCTTCCAGCTGCGGCTGCAG 

GAGTTCGTCAACCAGCGCGGTATGCTGGCCAATGGGCAGTCCTGCGAACCGGGCTGCCGGACTTTCTTCC 

GCATTTGCCTrAAGCACTTCCAGGCAACCTTCTCCGAGGGACCCTGCACCrTTGGCAATGTCTCCACGCC 

GGTATTGGGCACCAACTCCTTCGTCGTCAGGGACAAGAATAGCGGCAGTGGTCGCAACCCTCTGCAGTTG 

CCCTTCAATTTCACCTGGCCGGGAACCTTCTCACTCAACATCCAAGCTTGGCACACACCGGGAGACGACC 

TGCGGCCAGAGACTTCGCCAGGAAACTCTCTCATCAGCCAAATCATCATCCAAGGCTCTCTTGCTGTGGG 
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aISwcago^ 

CAGGGTCGCCTGTGCAATGAATGTATCCCCCACAATGGCTGTCGTCATO 

AGTGTGCCTGCGATGAGGGATGGGGAGGTCTGTTTTGTGACCAAGATCTCAACTACTGTACrrcACCA 

TACACTGGTGAGCACTGTGAGCTGGGACTCAGCAAGTGTGCCAGCAACCCCTGTCGAAATGGTGGCAGCT 

TArfTTGACCTGTGCGGACTCACCCTGCTTCAATGGGGGCTCTTGCCGGGAGCGCAACCAGGGGTCCAGT 

^ACAGGCACCCACTGTG^ 

CACGATCTGGAGAATGGGCCTGTGTGCACCTGCCCCGCTGGCTTCT 
TAACCCACGATGCCTGTGCCTCCGGACCCTGOT 

JIIotcgtctgc^ 

AGCTTCCCCTCGGTAGCTGTCTCGCTG^jGCGTGGGGCT 

agaotccagaaggacaacctaatccctgccgcccagctcaaaaacac^ 
cgggactcctaggacggggcagcatgcct^^^ 

GCCACTTCGGTTACACAGTGAGAAGCCAGAGTGTCXjAATATCAGCCAT^ 

taccaatcagtgtgtttgatatcagaagagaggaacgagtgtgtgat^ 

gcctactcagacacccagctccggcccagcagctgggcct^ 
atgggacatctttagtatgcacagtgctgctctgcggaggaggagggaatggcatgaact 

tcScccgcca^aotgca^^ 
ScISgggaa^gag^gaggagttagagg 
ggaotg^gc^agtggtcatcatgg^ 
tgggtcttcggcgctgct 

tcacagctgggcccaaatgagaaagcagagaggaggotgcaaaggatag 

COTGGAG^GCATTAAGCAGGAGCTACTCTGCAGGTGAGGAAAGCCCGAGGAGGGGACA^ 

TCCTGCCTCCAACCCCAGCAGGTGGGGTGCCACCTGCAGCCTCTAGGCAAGAGTO^ 

CCTGGTGCCTCTGGGCTCATGTGAACAGATGGGCTrAGGGCACGC^ 

CCTCACTGGGGAGCTCAGGGCCTTCATGCTAAACTCCCAATAAGGGAGATGGGGGGAAGGGG^GCJTCTGC 

CTAGGCCCTrCCCTCCCTCACACCCATTTTTGGGCCCTTGAGCCTGGGCTCC 

S^AGACCAACOT^^ 

AATCTAGTATTTTGATAATTTAAGAATCAGAAGCACTGGCCTTTCTACATTTTATAACAT^ 
ATAATGTGTATTTATAATATGAAACAGATGTGTACATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Figure 12 

AAACCCACTCCACCTTACTACCAGACAACCTTAGCCAAACCATTTACCCAAATAAAGmTA 

GATAGAAATTGAAACCTGGCGCAATAGATATAGTACCGCAAGGGAAAGATGAAAAATTATAAC 

CAAGCATAATATAGCAAGGACTAACCCCTATACCTTCTGCATAATGAATTAACTAGAAATAACT 

TTGCAAGGAGAGTCAAAGCTAAGGCCCCCGAAACCAGGCGAGCTACCTAAGAACAGCTAA^ 

GAGCACACCCGTCTATGTAGCAAAATAGTGGGAAGATTTATAGGTAGAGGCGACAAACCTACC 

GAGCCTGGTGATAGCTGGTTGTCCAAGATAGAATCTTAGTTCAACTTTAAATTTGCCCACAGAA 

CCCTCTAAATCCCCTTGTAAATTrAACTGTrAGTCCAAAGAGGAACAGCTCTTTGGACACTAGG 

AAAAAACCTTGTAGAGAGAGTGTGAGCCCAATTCCAC^ 

AGTGGTAGCCATAAGCATTTTTGGAATTCAACTAAAAACTGAAGGATCCTTGAGGACG 

ACCTGGCATACCTACACAGTCAGCGTTCAACAAGTGTTTGCAAAGGTACATTGGGGCACTG^G^G 

GGCACGAGTGATCTGTGACAATATCCCTGGTTTGGTGAGCCGGCAGCGGCAGCTGTGCCAGCGT 

TACCCAGACATCATGCGTTCAGTGGGCGAGGGTGCCCGAGAATGGATCCGAGAGTGTCAGCAC 

CAATTCCGCCACCACCGCTGGAACTGTACCACCCTGGACCGGGACCACACCGTCTTTGGCCGTG 

TCATGCTCAGAAGTAGCCGAGAGGCAGCTTTTGTATATGCCATCTCATCAGCAGGGGTGATCC^ 

CGCTATTACTCGCGCCTGTAGCCAGGGTGAACTGAGTGTGTGCAGCTGTGACCCCTACACCCGT 

GGCCGACACCATGACCAGCGTGGGACITITGACTGGGGTGGCTGCAGTGACAACATCCACTAC 

GGTGTCCGTTTTGCCAAGGCCTTCGTGGATGCCAAGGAGAAGAGGCTTAAGGATGCCCGGGCC 

CTCATGAACTTACATAATAACCGCTGTGGTCGCACGGCTGTGCGGCGGTTTGTCAAGCTGGAGT 
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GTAAGTGCCATGGCGTGAGTGGTTCCTGTACTCTGCGCACCTGCTGGCGTGCACTCTCAGATTT 

CCGCCGCACAGGTGATTACCTGCGGCGACGCTATGATGGGGCTGTGCAGGTGATGGCCACCCA 

AGATGGTGCCAACTTCACCGCAGCCCGCCAAGGCTATCGCCGTGCCACCCGGAGTGATCTTGTC 

TACTTTGACAACTCTCCAGATTACTGTGTCTTGGACAAGGCTGCAGGTTCCCTAGGCACTGCAG 

GCCGTGTCTGCAGCAAGACATCAAAAGGAACAGACGGTTGTGAAATCATGTGCTGTGGCCGAG 

GGTACGACACAACTCGAGTCACCCGTGTTACCCAGTGTGAGTGCAAATTCCACTGGTGCTGTGC 

TGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTCCATACTTGCAAAGCCCCCAAGAAGGC 

AGAGTGGCTGGACCAGACCTGAACACACAGATACCTCACTCATCCCTCCAATTCAAGCCTCTCA 

ACTCAAAAGCACAAGATCCTTGCATGCACACCTTCCTCCACCCTCCACCCTGGGCTGCTACCGC 

TTCTATTTAAGGATGTAGAGAGTAATCCATAGGGACCATGGTGTCCTGGCTGGTTCCTTAGCCC 

TGGGAAGGAGTTGTCAGGGGATATAAGAAACTGTGCAAGCTCCCTGATTTCCCGCTCTGGAGAT 

1TGAAGGGAGAGTAGAAGAGATAGGGGGTCTTTAGAGTGAAATGAGTTGCACTAAAGTACGTA 

GTTGAGGCTCCTTTTTTCTTTCCTTTGCACCAGCCT^ 
GGTACCTGTAGAGAGCTTCITITTGTTTCTACCTGGCCA^ 

GCATGTCCCITCTCTGAAGTCCGTTTGAGCAGAACrACCTGGTACCCCGAAAGAAAAATCTTAG 
GCTACCACATTCTATTATTGAGAGCCTGAGATGTTAGCCATAGTGGACAAGGTTCCATTCACAT 
GCTCATATGTTTATAAACTGTGTTTTGTAGAAGAAAAAGAATCATAACAATACAAACACACATT 
CATTCTCTCTTTTrCTCTCTACCATTCTCAACCTGTATTGG 

GCTGCCTGTTCAAACTGAGGTGGAATGCAGTGGTTCCCATGCTTAACAGATCATTAAAACACCC 
TAGAACACTCCTAGGATAGATTAATGT 

Figure 13 



ACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGCACGGTTTCGTGGGGACCCAG 

GCTTGCAAAGTGACGGTCATITrCTCTTrCTTTCTCCCTCTTGAGTCCTTCTGAGATGATGGCT 

GGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGTAGCGGCGGCTCTCGGCGGCCACCC 

TCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAATTCCAACGCTATCAAGAACCTGCCC 

CCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTCTGCAGTCAGCGCCGCGCCGGGAATCCTG 

TACCCGGGCGGGAATAAGTACCAGACCATTGACAACTACCAGCCGTACCCGTGCGCAGAGGAC 

GAGGAGTGCGGCACTGATGAGTACTGCGCTAGTCCCACCCGCGGAGGGGACGCAGGCGTGCAA 

ATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATGCGTCACGCTATGTGCTGCCCCGGGA 

ATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAAATCATTTCCGAGGAGAAATTGAGGA 

AACCATCACTGAAAGCTTTGGTAATGATCATAGCACCTTGGATGGGTATTCCAGAAGAACCACC 

TrGTCTTCAAAAATGTATCACACCAAAGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACT 

GTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGA 

AGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGTTG 

TTACTGTGGAGAAGGTCTGTCrTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTCT 

AGGCTrCACACTTGTCAGAGACACTAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAA 

TAGATGCTATGAAAACCTTTTATGACCTrCATCAACTCAATCCTAAGGATATACAAGTTCTGTG 

GTTTCAGTTAAGCATTCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTTCTTTAT 

GGAACTCCCCTGTGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAA 

ATGCAATGAAACTTTTAATTATTTITCTAAAGGTGCTGCACTGCCTATTTTrCCr 

AATTTTTGTACACATTGATTGTTATCTrGACTGACAAATATTCTATATTGAACT 

TTTCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCAA 

GGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAAT ACTAG CTrATTTTC 

TGAAATGTACTATCTTAATGCTTAAATTATAm 

ATTTAACATTTAATATCATGAAATGTTATAA 

Figure 14 



AGAAAGCGGGAGCCCGCGGCGAGCGTAGCGCAAGTCCGCTCCCTAGGCATCGCTGCGCTGGCA 
GCGATTCGCTGTCTCTTGTGAGTCAGGGGACAACGCTTCGGGGCAACTGTGAGTGCGCGTGTGG 
GGGACCTCGATTCTCTTCAGATCTCGAGGATTCGGTCCGGGGACGTCTCCTGATCCCCTACTAA 
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AGCGCCTGCTAACnTGAAAAGGAGCACTGTGTCCTGCAAAGTTTGACACATAAAGGATAGGA 

AAAGAGAGGAGAGAAAAGCAACTGAGTTGAAGGAGAAGGAGCTGATGCGGGCCTCCTGATCA 

ATTAAGAGGAGAGTTAAACCGCCGAGATCCCGGCGGGACCAAGGAGGTGCGGGGCAAGAAGG 

AACGGAAGCGGTGCGATCCACAGGGCTGGGTTTTCTTGCACCTTGGGTCACGCCTCCTTGGCGA 

GAAAGCGCCTCGCATTTGATTGCTTCCAGTTATTGCAGAACTTCCTGTCCTGGTGGAGAAGCGG 

GTCrCGCTTGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGGGTCCTGGGTC 

CCCGAACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGACCCAAGTGAGGGGCCC 

CGTGTTGGGGTCCTCCCTCCCTTTGCATTCCCACCCCTCCGGGCTTTGCGTCTTCCTGGGGACCC 

CCTCGCCGGGAGATGGCCGCGTTGATGCGGAGCAAGGATTCGTCCTGCTGCCTGCTCCTACTGG 

CCGCGGTGCTGATGGTGGAGAGCTCACAGATCGGCAGTTCGCGGGCCAAACTCAACTCCATCA 

AGTCCTCTCTGGGCGGGGAGACGCCTGGTCAGGCCGCCAATCGATCTGCGGGCATGTACCAAG 

GACTGGCATTCGGCGGCAGTAAGAAGGGCAAAAACCTGGGGCAGGCCTACCCTTGTAGCAGTG 

ATAAGGAGTGTGAAGTTGGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGCATGG 

TGTGTCGGAGAAAAAAGAAGCGCTGCCACCGAGATGGCATGTGCTGCCCCAGTACCCGCTGCA 

ATAATGGCATCTGTATCCCAGTTACTGAAAGCATCTTAACCCCTCACATCCCGGCTCTGGATGG 

TACTCGGCACAGAGATCGAAACCACGGTCATTACTCAAACCATGACTTGGGATGGCAGAATCT 

AGGAAGACCACACACTAAGATGTCACATATAAAAGGGCATGAAGGAGACCCCTGCCTACGATC 

ATCAGACTGCATTGAAGGGTTTrGCTGTGCTCGTCATTTCTGGACCAAAATCTGCAAACCAGTG 

CTCCATCAGGGGGAAGTCTGTACCAAACAACGCAAGAAGGGTTCTCATGGGCTGGAAATTTTC 

CAGCGTTGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATGGAAAGATGCCACCTACTCCTCCA 

AAGCCAGACTCCATGTGTGTCAGAAAATTTGATCACCATTGAGGAACATCATCAATTGCAGACT 

GTGAAGTTGTGTATTTAATGCATTATAGCATGGTGGAAAATAAGGTTCAGATGCAGAAGAATG 

GCTAAAATAAGAAACGTGATAAGAATATAGATGATCACAAAAAGGGAGAAAGAAAACATGAA 

CTGAATAGATTAGAATGGGTGACAAATGCAGTGCAGCCAGTGTTTCCATTATGCAACTTGTCTA 

TGTAAATAATGTACACATTTGTGGAAAATGCTATTATTAAGAGAACAAGCACACAGTGGAAAT 

TACTGATGAGTAGCATGTGACITrCCAAGAGTTTAGGTTGTGCTGGAGGAGAGGTTTCCTTCAG 

ATTGCTGATTGCTTATACAAATAACCTACATGCCAGATTTCTATTCAACGTTAGAGTTTAACAA 

AATACTCCTAGAATAACTTGTTATACAATAGGTrCTAAAAATAAAATTGCTAAACAAGAAATGA 

AAACATGGAGCATTGTTAATTrACAACAGAAAATTACCTTTTGATTTGTAACACTACTTCTGCTG 

TTCAATCAAGAGTCITGGTAGATAAGAAAAAAATCAGTCAATATTTCCAAATAATTGCAAAATA 

ATGGCCAGTTGTTTAGGAAGGCCTTTAGGAAGACAAATAAATAACAAACAAACAGCCACAAAT 

AClllirilTCAAAATTTTAGTTTTACCTGTAATTAATAAGAACTGATACAAGACAAAAACAG 

ccttcagattctacggaatgacagtatatctctctitatcctAtgtgattcctgctctgaatgca 

ttatattttccaaactatacccataaattgtgactagtaaaatacttacacagagcagaatttt 

cacagatggcaaaaaaatttaaagatgtccaatatatgtgggaaaagagctaacagagagatc 

attatttcttaaagattggccataaccrgtattttgatagaattagattggtaaatacatgtatr 

catacatactctgtggtaatagagacttgagctggatctgtactgcactggagtaagcaagaa 

aattgggaaaactttttcgtttgttcaggtitrggcaacacatagatcatatgtctgaggcaca 

agttggctgttcatattgaaaccaggggatgcacagtctaaatgaatatctgcatgggatttg 

ctatcataatatttactatgcagatgaattcagtgtgaggtcctgtgtccgtactatcctcaaat 

tatttattttatagtgctgagatcctcaaataatct^ 

cctgaagtagacagagtagtgaggtttcattgccctctataagcttctgactagccaatggcat 

catccaatittctrcccaaaccrctgcagcatctgcttrattgccaaagggctagtitcggtt^ 

ctgcagccattgcggttaaaaaatataagtaggataacttgtaaaacctgcatattgctaatct 

atagacaccacagtttctaaattctttgaaaccactt^ 

ctaaatactttgtctggagcacaaaacaataaaaggttat^ 

tgtagaccacaattcacttitragtttrcttttactraaatcccatct 

tctcccagtagagattgagtttgagcctgtatatctattaaaaatitcaacttcccacatatatt 

tactaagatgattaagacttacattttctgcacaggtctgcaaaaacaaaaattataaactagt 

ccatccaagaaccaaagtttgtataaacaggttgctataagcttggtgaaatgaaaatggaac 

atttcaatcaaacatttcctatataacaattattatatttacaatttggtttct 

ttatgtccacccttttaaaaattattatttgaagtaatttatttacaggaaatgttaatgagat^ 

TATTTTCTTATAGAGATATTTCTTACAGAAAGCTTTGTAGCAGAATATATTTGCAGCTATO 

TTGTAATTTAGGAAAAATGTATAATAAGATAAAATCTATTAAATTTTTCTCCTCT 

ATTCAAAGC 
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ACACACAGGCGGCGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTG 

CCTGCTGCTGGCGGCGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCA 

GTCAAGCCCGGCCCGGCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGC 

GAGGTTGAGGAACTGATGGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGA 

GGCAGAAGAAGCTGCTGCTAAAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTA 

TCACAATGAGACCAACACAGACACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAAT 

TCACAAGATAACCAACAACCAGACTGGACAAATGGTCTTTTCAGAGACAGTTATCACATCTGTG 

GGAGACGAAGAAGGCAGAAGGAGCCACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCAT 

GTACTGCCAGTTTGCCAGCTTCCAGTACACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGC 

ACCCGGGACAGTGAGTGCTGTGGAGACCAGCTGTGTGTCTGGGGTCACTGCACCAAAATGGCC 

ACCAGGGGCAGCAATGGGACCATCTGTGACAACCAGAGGGACTGCCAGCCGGGGCTGTGCTGT 

GCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCTTTGCC 

ATGACCCCGCCAGCCGGCTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATGGAGCCTTGGA 

CCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGCCCCACAGCCACAGCCTGGTGTATGTGTGC 

AAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGGGAGATCCTGCTGCCCAGAGAGGTCCCC 

GATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCGCCAGGAGCTGGAGGACCTGGAGAGG 

AGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTGCCGCCGCTGCACTGCTGGGAGGG 

GAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAATAGAAATAGCTAATTTATTTCC 

CCAGGTGTGTGCTITAGGCGTGGGCTGACCAGGCTTCTrCCTACATCTrCTTCCCAGTAAGTTrC 

CCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCTCCCCCAGGCTGTTCTCCAGGCT 

TCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTTGCCACCCCTGT 

CCAGATTATTGGCTGCTTTGCCrCTACCAGTTGGCAGACAGCCGTTTGTTCTACATGGCTTTGAT 

AATTGTITGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTCTGATTGGTTTrGGGGAAATG 

TGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATGCAACAAATGAATTTTCCA 

CGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGCAGATGAAATGTTCTGTTC 

ACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCCTACCTCTGTGCCAGGGCA 

GCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGGAGGGGGTCATTGTTCTCC 

TCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCCCAAGTCACACAGCTAGT 

GAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCTCTCCAC TACCC CACAC 

CAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGATTTTTCTTTTGAGGCA 

TGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAACTACTGTTAGGA 

ACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGATATTGACACTGT 

CCCTCTTTGGCAGTTGCATrAGTAACTTTGAAAGGTATATGACTGAGCGTAGCATACAGGTTAA 

CCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGCAAAATCAC 

TTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGCTGTGTG 

AAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATGTTTTC 

AGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCITAAAGTTTAAAGTTGCACATG 

ATTGTATAAGCATGCTTTCTTTGAGTTTTAA^ 

AGCATAAATCACTTCAACTGCTCTTCT 



Figure 16 



GACAAACAGACGACGTGCTGAGCTGCCAGCTTAGTGGAAGCTCTGCTCTGGGTGGAGAGCAGC 

CTCGCTTTGGTGACGCACAGTGCTGGGACCCTCCAGGAGCCCCGGGATTGAAGGATGGTGGCG 

GCCGTCCTGCTGGGGCTGAGCTGGCTCTGCTCTCCCCTGGGAGCTCTGGTCCTGGACTTCAACA 

ACATCAGGAGCTCTGCTGACCTGCATGGGGCCCGGAAGGGCTCACAGTGCCTGTCTGACACGG 

ACTGCAATACCAGAAAGTTCTGCCTCCAGCCCCGCGATGAGAAGCCGTTCTGTGCTACATGTCG 

TGGGTTGCGGAGGAGGTGCCAGCGAGATGCCATGTGCTGCCCTGGGACACTCTGTGTGAACGA 

TGTTTGTACTACGATGGAAGATGCAACCCCAATATTAGAAAGGCAGCTTGATGAGCAAGATGG 

CACACATGCAGAAGGAACAACTGGGCACCCAGTCCAGGAAAACCAACCCAAAAGGAAGCCAA 

GTATTAAGAAATCACAAGGCAGGAAGGGACAAGAGGGAGAAAGTTGTCTGAGAACTTTTGACT 

GTGGCCCTGGACTTTGCTGTGCTCGTCATITTTGGACGAAA^ 

GGACAGGTCTGCTCCAGAAGAGGGCATAAAGACACTGCTCAAGCTCCAGAAATCTTCCAGCGT 
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TGCGACTGTGGCCCTGGACTACTGTGTCGAAGCCAATTGACCAGCAATCGGCAGCATGCTOTAT 
TAAGAGTATGCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAATCCACATTGC 

ATTTGAG 
Figure 17 

ATGGGGCTCTGGGCGCTGTTGCCTGGCTGGGTTTCTGCTACGCTGCTGCTGGCGCTGGCCGCT 

GCCCGCAGCCCTGGCTGCCAACAGCAGTGGCCGATGGTGGGGTATTGTGAACGTAGCCTCCTCC 

ACGAACCTGCrTACAGACTCCAAGAGTCTGCAACTGGTACTCGAGCCCAGTCTGCAGCTGTTGA 

GCCGCAAACAGCGGCGCCTGATACGCCAAAATCCGGGGATCCTGCACAGCGTGAGTGGGGGGC 

TGCAGAGTGCCGTGCGCGAGTGCAAGTGGCAGTTCCGGAATCGCCGCTGGAACTGTCCCACTG 

CTCCAGGGCCCCACCTCTTCGGCAAGATCGTCAACCGAGGCTGTCGAGAAACGGCGTTTATCTT 

CGCTATCACCTCCGCCGGGGTCACCCATTCGGTGGCGCGCTCCTGCTCAGAAGGTTCCATCGAA 

TCCTGCACGTGTGACTACCGGCGGCGCGGCCCCGGGGGCCCCGACTGGCACTGGGGGGGCTGC 

AGCGACAACATTGACTTCGGCCGCCTCTTCGGCCGGGAGTTCGTGGACTCCGGGGAGAAGGGG 

CGGGACCTGCGCTrCCTCATGAACCTTCACAACAACGAGGCAGGCCGTACGACCGTATTCTCCG 

AGATGCGCCAGGAGTGCAAGTGCCACGGGATGTCCGGCTCATGCACGGTGCGCACGTGCTGGA 

TGCGGCTGCCCACGCTGCGCGCCGTGGGCGATGTGCTGCGCGACCGCTTCGACGGCGCCTCGCG 

CGTCCTGTACGGCAACCGCGGCAGCAACCGCGCTTCGCGAGCGGAGCTGCTGCGCCTGGAGCC 

GGAAGACCCGGCCCACAAACCGCCCTCCCCCCACGACCTCGTCTACTTCGAGAAATCGCCCAAC 

TTCTGCACGTACAGCGGACGCCTGGGCACAGCAGGCACGGCAGGGCGCGCCTGTAACAGCTCG 

TCGCCCGCGCTGGACGGCTGCGAGCTGCTCTGCTGCGGCAGGGGCCACCGCACGCGCACGCAG 

CGCGTCACCGAGCGCTGCAACTGCACCTTCCACTGGTGCTGCCACGTCAGCTGCCGCAACTGCA 

CGCACACGCGCGTACTGCACGAGTGTCTGTGA 



Figure 18 

AGCAGAGCGGACGGGCGCGCGGGAGGCGCGCAGAGCTTTCGGGCTGCAGGCGCTCGCTGCCGC 

TGGGGAATTGGGCTGTGGGCGAGGCGGTCCGGGCTGGCCTTTATCGCTCGCTGGGCCCATCGTT 

TGAAACTTTATCAGCGAGTCGCCACTCGTCGCAGGACCGAGCGGGGGGCGGGGGCGCGGCGAG 

GCGGCGGCCGTGACGAGGCGCTCCCGGAGCTGAGCGCTTCTGCTCTGGGCACGCATGGCGCCC 

GCACACGGAGTCTGACCTGATGCAGACGCAAGGGGGTTAATATGAACGCCCCTCTCGGTGGAA 

TCTGGCTCTGGCTCCCTCTGCTCTTGACCTGGCTCACCCCCGAGGTCAACTCTTCATGGTGGTAC 

ATGAGAGCTACAGGTGGCTCCTCCAGGGTGATGTGCGATAATGTGCCAGGCCTGGTGAGCAGC 

CAGCGGCAGCTGTGTCACCGACATCCAGATGTGATGCGTGCCATTAGCCAGGGCGTGGCCGAG 

TGGACAGCAGAATGCCAGCACCAGTTCCGCCAGCACCGCTGGAATTGCAACACCCTGGACAGG 

GATCACAGCCTTTTTGGCAGGGTCCTACTCCGAAGTAGTCGGGAATCTGCCTTTGTTTATGCCAT 

CTCCTCAGCTGGAGTTGTATTTGCCATCACCAGGGCCTGTAGCCAAGGAGAAGTAAAATCCTGT 

TCCTGTGATCCAAAGAAGATGGGAAGCGCCAAGGACAGCAAAGGCATTTTTGATTGGGGTGGC 

TGCAGTGATAACATTGACTATGGGATCAAATTTGCCCGCGCATTTGTGGATGCAAAGGAAAGG 

AAAGGAAAGGATGCCAGAGCCCTGATGAATCTTCACAACAACAGAGCTGGCAGGAAGGCTGTA 

AAGCGGTTCTTGAAACAAGAGTGCAAGTGCCACGGGGTGAGCGGCTCATGTACTCTCAGGACA 

TGCTGGCTGGCCATGGCCGACTTCAGGAAAACGGGCGATTATCTCTGGAGGAAGTACAATGGG 

GCCATCCAGGTGGTCATGAACCAGGATGGCACAGGTTTCACTGTGGCTAACGAGAGGTTTAAG 

AAGCCAACGAAAAATGACCTCGTGTATTTTGAGAATTCTCCAGACTACTGTATCAGGGACCGAG 

AGGCAGGCTCCCTGGGTACAGCAGGCCGTGTGTGCAACCTGACTTCCCGGGGCATGGACAGCT 

GTGAAGTCATGTGCTGTGGGAGAGGCTACGACACCTCCCATGTCACCCGGATGACCAAGTGTG 

GGTGTAAGTTCCACTGGTGCTGCGCCGTGCGCTGTCAGGACTGCCTGGAAGCTCTGGATGTGCA 

CACATGCAAGGCCCCCAAGAACGCTGACTGGACAACCGCTACATGACCCCAGCAGGCGTCACC 

ATCCACCTTCCCTTCTACAAGGACTCCATTGGATCTGCAAGAACACTGGACCTTTGGGTTCTTTC 

TGGGGGGATATTTCCTAAGGCATGTGGCCTTTATCTCAACGGAAGCCCCCTCTTCCTCCCTGGG 

GGCCCCAGGATGGGGGGCCACACGCTGCACCTAAAGCCTACCCTATTCTATCCATCTCCTGGTG 

TTCTGCAGTCATCTCCCCTCCTGGCGAGTTCTCTTTGGAAATAGCATGACAGGCTGTTCAGCCGG 

GAGGGTGGTGGGCCCAGACCACTGTCTCCACCCACCTTGACGTITCTrCTTTCTAGAGCAGTTG 
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GCCAAGCAGAAAAAAAAGTGTCTCAAAGGAGCTITCrcAATC 
TAAGAAATTCCATACTTCTCTCAGATGGAACA 

AACTTTAACTTTTGAAAAGACCAAGACTTTTGTCTGTACAAGTGGTTTTACAGCT 

G^G^G^TTACCTGGAGAAGAATGGCnTTCAATACC 

TTTTCAAGGCATITATTGCCATATTAAAATCTGATGTAACAAGGTGGGGACGTrc 

ACTATGGTGTGTTGTATCTITGTAAGAGCAAAAGCCTCAGAAAGGGATTGCTTTGCATTACTGT 

C^CTTGATATAAAAAATCTITAGGGAATGAGAGTTCCTTCT 
AAAAAGAAGATGAATGGTCTGGCAATATTCTGTAACTATTGGGTC 

TTAGTGGATGGAATATCAGAAGTATATCTGTACAGATCAAGAAAAAAAGGAAGAATAAAATTC 
CTATATCAT 



Figure 19 

CGGGAGTCTTCGGGGAGCTATGCTGAGACCGGGTGGTGCGGAGGAAGCTGCGCAGCTCCCGCT 

TCGGCGCGCCAGCGCCCCGGTCCCTGTGCCGTCGCCCGCGGCCCCCGACGGCTCCCGGGCTTCG 

GCCCGCCTAGGTCTTGCCTGCCTTCTGCTCCTGCTGCTGCTGACGCTGCCGGCCCGCGTAGACAC 

GTCCTGGTGGTACATTGGGGCACTGGGGGCACGAGTGATCTGTGACAATATCCCTGGTTTGGTG 

AGCCGGCAGCGGCAGCTGTGCCAGCGTTACCCAGACATCATGCGTTCAGTGGGCGAGGGTGCC 

CGAGAATGGATCCGAGAGTGTCAGCACCAATTCCGCCACCACCGCTGGAACTGTACCACCCTG 

GACCGGGACCACACCGTCTTTGGCCGTGTCATGCTCAGAAGTAGCCGAGAGGCAGCTTTTGTAT 

ATGCCATCTCATCAGCAGGGGTAGTCCACGCTATTACTCGCGCCTGTAGCCAGGGTGAACTGAG 

TGTGTGCAGCTGTGACCCCTACACCCGTGGCCGACACCATGACCAGCGTGGGGACTTTGACTGG 

GGTGGCTGCAGTGACAACATCCACTACGGTGTCCGTTTTGCCAAGGCCTrCGTGGATGCCAAGG 

AGAAGAGGCTTAAGGATGCCCGGGCCCTCATGAACTTACATAATAACCGCTGTGGTCGCACGG 

CTGTGCGGCGGTTTCTGAAGCTGGAGTGTAAGTGCCATGGCGTGAGTGGTTCCTGTACTCTGCG 

CACCTGCTGGCGTGCACTCTCAGATTTCCGCCGCACAGGTGATTACCTGCGGCGACGCTATGAT 

GGGGCTGTGCAGGTGATGGCCACCCAAGATGGTGCCAACTTCACCGCAGCCCGCCAAGGCTAT 

CGCCGTGCCACCCGGACTGATCITGTCTACTTTGACAACTCTCCAGATTACTGTGTCTTGGACAA 

GGCTGCAGGTTCCCTAGGCACTGCAGGCCGTGTCTGCAGCAAGACATCAAAAGGAACAGACGG 

TTGTGAAATCATGTGCTGTGGCCGAGGGTACGACACAACTCGAGTCACCCGTGTTACCCAGTGT 

GAGTGCAAATTCCACTGGTGCTGTGCTGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTC 

CATACTTGCAAAGCCCCCAAGAAGGCAGAGTGGCTGGACCAGACCTGAACACACAGATACCTC 

ACTCATCCCTCCAATTCAAGCCTCTCAACTCAAAAGCACAAGATCCTTGCATGCACACCTTCCT 

CCACCCTCCACCCTGGGCTGCTACCGCTTCTATTTAAGGATGTAGAGAGTAATCCATAGGGACC 

ATGGTGTCCTGGCTGGTTCCTrAGCCCTGGGAAGGAGTTGTCAGGGGATATAAGAAACTGTGCA 

AGCTCCCTGATTTCCCGCTCTGGAGATTTGAAGGGAGAGTAGAAGAGATAGGGGGTCTTTAGA 

GTGAAATGAGTTGCACTAAAGTACGTAGTTGAGGCTCCITTTrTCTTTCC^ 

CGACACTTCITGGTGTGCAAGAGGAAGGGTACCTGTAGAGAGCTTCTTTTTGTTTCT 

CAAAG1TAGATGGGACAAAGATGAATGGCATGTCCCTTCTCTGAAGTCCGTTTGAGCAGAACTA 

CCTGGTACCCCGAAAGAAAAATCTTAGGCTACCACATTCTATTATTGAGAGCCTGAGATGTTAG 

CCATAGTGGACAAGGTTCCATTCACATGCTCATATGTTTATAAACTGTGTTTTGTAGAAGAAAA 

AGAATCATAACAATACAAACACACATTCATTCTCTCnTITrCTCTCTACCATTCT 

TGGACAGCACTGCCTCTTTTGCTTACTTGCTGCCTGTTCAAACTGAGGTGGAATGCAGTGGTTCC 

CATGCTTAACAGATCATTAAAACACCCTAGAACACTCCTAGGATAGATTAATGT 



Figure 20 

GCGCTTCTGACAAGCCCGAAAGTCATTTCCAATCTCAAG 

CGTCGCTCCCCCTCYTCATGGTCGCGGGCAAACTTCCTCCTCGGCGCCTCTTCTAATGGAGCCCC 
ACCTGCTCGGGCTGCTCCTCGGCCTCCTGCTCGGTGGCACCAGGGTCCTCGCTGGCTACCCAAT 
TTGGTGGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGGCTCACAGCCCCTGCTCTGCGGC 
TCCATCCCAGGCCTGGTCCCCAAGCAACTGCGCTTCTGCCGCAATTACATCGAGATCATGCCCG 
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CGTGGCCGAGGGCGTGAAGCTGGGCATCCAGGAGTGCCAGCACCAGTTCCGGGGCCGCCGCT 

GGAACTGCACCACCATAGATGACAGCCTGGCCATCTTTGGGCCCGTCCTCGACAAAGCCACCCG 

CGAGTCGGCCTTCGTTCACGCCATCGCCTCGGCCGGCGTGGCCTTCGCCGTCACCCGCTCCTGC 

GCCGAGGGCACCTCCACCATTTGCGGCTGTGACTCGCATCATAAGGGGCCGCCTGGCGAAGGC 

TGGAAGTGGGGCGGCTGCAGCGAGGACGCTGACTTCGGCGTGTTAGTGTCCAGGGAGTTCGCG 

GATGCGCGCGAGAACAGGCCGGACGCGCGCTCGGCCATGAACAAGCACAACAACGAGGCGGG 

CCGCACGACTATCCTGGACCACATGCACCTCAAATGCAAGTGCCACGGGCTGTCGGGCAGCTGT 

GAGGTGAAGACCTGCTGGTGGGCGCAGCCTGACTTCCGTGCCATCGGTGACTTCCTCAAGGACA 

AGTATGACAGCGCCTCGGAGATGGTAGTAGAGAAGCACCGTGAGTCCCGAGGCTGGGTGGAGA 

CCCTCCGGGCCAAGTACTCGCTCTTCAAGCCACCCACGGAGAGGGACCTGGTCTACTACGAGA 

ACTCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGTTCCTTTGGCACAAGGGACCGGACTTG 

CAATGTCACCTCCCACGGCATCGATGGCTGCGATCTGCTCTGCTGTGGCCGGGGCCACAACACG 

AGGACGGAGAAGCGGAAGGAAAAATGCCACTGCATCTTCCACTGGTGCTGCTACGTCAGCTGC 

CAGGAGTGTATTCGCATCTACGACGTGCACACCTGCAAGTAGGGCACCAG 



Figure 21 

ATGAGTCCCCGCTCGTGCCTGCGTTCGCTGCGCCTCCTCGTCTTCGCCGTCTTCTCAGCCGCCGC 

GAGCAACTGGCTGTACCTGGCCAAGCTGTCGTCGGTGGGGAGCATCTCAGAGGAGGAGACGTG 

CGAGAAACTCAAGGGCCTGATCCAGAGGCAGGTGCAGATGTGCAAGCGGAACCTGGAAGTCAT 

GGACTCGGTGCGCCGCGGTGCCCAGCTGGCCATTGAGGAGTGCCAGTACCAGTTCCGGAACCG 

GCGCTGGAACTGCTCCACACTCGACTCCTTGCCCGTCTTCGGCAAGGTGGTGACGCAAGGGATT 

CGGGAGGCGGCCTTGGTGTACGCCATCTCTTCGGCAGGTGTGGCCTTTGCAGTGACGCGGGCGT 

GCAGCAGTGGGGAGCTGGAGAAGTGCGGCTGTGACAGGACAGTGCATGGGGTCAGCCCACAG 

GGCrTCCAGTGGTCAGGATGCTCTGACAACATCGCCTACGGTGTGGCCTTCTCACAGTCGTrTG 

TGGATGTGCGGGAGAGAAGCAAGGGGGCCTCGTCCAGCAGAGCCCTCATGAACCTCCACAACA 

ATGAGGCCGGCAGGAAGGCCATCCTGACACACATGCGGGTGGAATGCAAGTGCCACGGGGTGT 

CAGGCTCCTGTGAGGTAAAGACGTGCTGGCGAGCCGTGCCGCCCTTCCGCCAGGTGGGTCACG 

CACTGAAGGAGAAGTTTGATGGTGCCACTGAGGTGGAGCCACGCCGCGTGGGCTCCTCCAGGG 

CACTGGTGCCACGCAACGCACAGTTCAAGCCGCACACAGATGAGGACTTGGTGTACTTGGAGC 

CTAGCCCCGACTTCTGTGAGCAGGACATGCGCAGCGGCGTGCTGGGCACGAGGGGCCGCACAT 

GCAACAAGACGTCCAAGGCCATCGACGGCTGTGAGCTGCTGTGCTGTGGCCGCGGCTTCCACA 

CGGCGCAGGTGGAGCTGGCTGAACGCTGCAGCTGCAAATTCCACTGGTGCTGCTrCGTCAAGTG 

CCGGCAGTGCCAGCGGCTCGTGGAGTTGCACACGTGCCGATGA 



Figure 22 

ATTAATTCTGGCTCCACTTGTTGCTCGGCCCAGGTTGGGGAGAGGACGGAGGGTGGCCGCAGC 

GGGTTCCTGAGTGAATTACCCAGGAGGGACTGAGCACAGCACCAACTAGAGAGGGGTCAGGGG 

GTGCGGGACTCGAGCGAGCAGGAAGGAGGCAGCGCCrGGCACCAGGGCTTTGACTCAACAGA 

ATTGAGACACGTTTGTAATCGCTGGCGTGCCCCGCGCACAGGATCCCAGCGAAAATCAGATTTC 

CTGGTGAGGTTGCGTGGGTGGATTAATTTGGAAAAAGAAACTGCCTATATCTTGCCATCAAAAA 

ACTCACGGAGGAGAAGCGCAGTCAATCAACAGTAAACTTAAGAGACCCCCGATGCTCCCCTGG 

TTTAACTTGTATGCTTGAAAATTATCTGAGAGGGAATAAACATCTTTTCCTTCTTCCCTCT^ 

AAGTCCATTGGAATATTAAGCCCAGGAGTTGCTITGGGGATGGCrGGAAGTGCAATGTCTTCCA 

AGTTCTTCCTAGTGGCnTTGGCCATATTTTTCTCCTTCGCCCAGGTTGTAATTG 

GGTGGTCGCTAGGTATGAATAACCCTGTTCAGATGTCAGAAGTATATATTATAGGAGCACAGCC 

TCTCTGCAGCCAACTGGCAGGACTTTCTCAAGGACAGAAGAAACTGTGCCACTTGTATCAGGAC 

CACATGCAGTACATCGGAGAAGGCGCGAAGACAGGCATCAAAGAATGCCAGTATCAATTCCGA 

CATCGACGGTGGAACTGCAGCACTGTGGATAACACCTCTGTTTTTGGCAGGGTGATGCAGATAG 

GCAGCCGCGAGACGGCCTTCACATACGCCGTGAGCGCAGCAGGGGTGGTGAACGCCATGAGCC 

GGGCGTGCCGCGAGGGCGAGCTGTCCACCTGCGGCTGCAGCCGCGCCGCGCGCCCCAAGGACC 
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TGCCGCGGGACTGGCTCTGGGGCGGCTGCGGCGACAACATCGACTATGGCTACCGCTTTGCCAA 

GGAGTTCGTGGACGCCCGCGAGCGGGAGCGCATCCACGCCAAGGGCTCCTACGAGAGTGCTCG 

CATCCTCATGAACCTGCACAACAACGAGGCCGGCCGCAGGACGGTGTACAACCTGGCTGATGT 

GGCCTGCAAGTGCCATGGGGTGTCCGGCTCATGTAGCCTGAAGACATGCTGGCTGCAGCTGGC 

AGACTTCCGCAAGGTGGGTGATGCCCTGAAGGAGAAGTACGACAGCGCGGCGGCCATGCGGCT 

CAACAGCCGGGGCAAGTTGGTACAGGTCAACAGCCGCTTCAACTCGCCCACCACACAAGACCT 

GGTCTACATCGACCCCAGCCCTGACTACTGCGTGCGCAATGAGAGCACCGGCTCGCTGGGCAC 

GCAGGGCCGCCTGTGCAACAAGACGTCGGAGGGCATGGATGGCTGCGAGCTCATGTGCTGCGG 

CCGTGGGTACGACCAGTTCAAGACCGTGCAGACGGAGCGCTGCCACTGCAAGTTCCACTGGTG 

CTGCTACGTCAAGTGCAAGAAGTGCACGGAGATCGTGGACCAGTTTGTGTGCAAGTAGTGGGT 

nnCACCCAGCACTCAGCCCCGCTCCCAGGACCCGCTTATTTATAGAAAGTACAGTGATTCTGGT 

TTTTGGTTTTTAGAAATATTTTTTATTTTTCCCCAAGAA^ 

TTACCATCTAAGAACTCTGTGGTTTATTATTAATATTATAATTATTATTTGGCAATAATGG 

GGGAACCACGAAAAATATTTATTTTGTGGATCTTTGAAAAGGTAATACAAGACTTCTTTTGGAT 

AGTATAGAATGAAGGGGGAAATAACACATACCCTAACTTAGCTGTGTGGGACATGGTACACAT 

CCAGAAGGTAAAGAAATACATTTTClTmCTCAAATATGCCATCATATGGGATGGGTAGGTTC 

CAGTTGAAAGAGGGTGGTAGAAATCTATrCACAATTCAGCTTCTATGACCAAAATGAGTTGTAA 

ATTCTCTGGTGCAAGATAAAAGGTCTTGGGAAAACAAAACAAAACAAAACAAACCTCCCTTCC 

CCAGCAGGGCTGCTAGCTTGCTTTCTGCATTTTCAAAATGATAATTTACAATGGAAGGACAAGA 

ATGTCATATTCTCAAGGAAAAAAGGTATATCACATGTCTCATTCTCCTCAAATATTCCATTTGCA 

GACAGACCGTCATATTCTAATAGCTCATGAAATTTGGGCAGCAGGGAGGAAAGTCCCCAGAAA 

TTAAAAAATTTAAAACTCTTATGTCAAGATGTTGATTTGAAGCTGTTATAAGAATTGGGATTCC 

AGATITGTAAAAAGACCCCCAATGArrCTGGACACTAGATTTiTTGTITGGGGAGGTTGGCTTG 

AACATAAATGAAATATCCTGTATTTrCTrAGGGATACTTGGTTAGTAAATTATAATAGTAGAAA 

TAATACATGAATCCCATTCACAGGTTTCTCAGCCCAAGCAACAAGGTAATTGCGTGCCATTCAG 

CACTGCACCAGAGCAGACAACCTATTTGAGGAAAAACAGTGAAATCCACCTTCCTCTTCACACT 

GAGCCCrCTCTGATTCCTCCGTGTTGTGATGTGATGCTGGCCACGTTTCCAAACGGCAGCTCCAC 

TGGGTCCCCTTTGGTTGTAGGACAGGAAATGAAACATTAGGAGCTCTGCTTGGAAAACAGTTCA 

CTACTTAGGGATTTTTGTTTCCTAAAACTTTTATTTTGAGGAGCAGT^ 

ACAGAACTTGGCTAATGGAATTCACAGAGGTGTTGCAGCGTATCACTGTTATGATCCTGTGTTT 

AGATTATCCACrCATGCTTCTCCTATTGTACTGCAGGTGTACCTTAAAACTGTTCCCAGTGTACT 

TGAACAGTTGCATTTATAAGGGGGGAAATGTGGTTTAATGGTGCCTGATATCTCAAAGTCTITr 

GTACATAACATATATATATATATACATATATATAAATATAAATATAAATATATCTCATTGCAGC 

CAGTGATTTAGATTTACAGCTrACTCTGGGGTTATCTCTCTGTCTAGAGCATTGTTGTCCTTCAC 

TGCAGTCCAGTTGGGATTATTCCAAAAGTTITTTGAGTCTTGAGCTTGGGCTGTGGCCCCGCTGT 

GATCATACCCTGAGCACGACGAAGCAACCTCGTTTCTGAGGAAGAAGCTTGAGTTCTGACTCAC 

TGAAATGCGTGTTGGGTTGAAGATATCTTITITrCTTTTCTGCCTCACCCCT^ 
CATITCTGTTCACTITGTGGAGAGGGCATTACTTGTTCGTTATAGACATGGACGTTAAGAGATAT 
TCAAAACTCAGAAGCATCAGCAATGTTTCTCTrTTCTTAGTTCATTCTGCAGAATGGAAACCCAT 
GCCTATTAGAAATGACAGTACTTATTAATTGAGTCCCTAAGGAATATTCAGCCCACTACATAGA 

TAGCTITITI TIT ITI TITIT^ 

TCrrATCTCAGACTTACGTTGTmAAAAGTTTGGAAAGATACACATCTITrCATACCCCCCCTT 

AGGAGGTTGGGCTTTCATATCACCTCAGCCAACTGTGGCTCTTAATTTATTGCATAATGATATCC 

ACATCAGCCAACrGTGGCTCTTTAATTTATTGCATAATGATATTCACATCCCCTCAGTTGCAGTG 

AArrGTGAGCAAAAGATCTTGAAAGCAAAAAGCACTAATTAGTTTAAAATGTCACITim 

TTTTATTATACAAAAACCATGAAGTACTTTTTTTATTTGCrAAATCA 

CTCATGTITATGAAGAGAGTTGAGTTrAACAATCCTAGCTTTTAAAAGAAACTATTTAATGTAA 
AATATTCTACATGTCATTCAGATATTATGTATATCnTCTAGCCTTTATTCTGTACTTTTAATGTAC 
ATATTTCTGTCTTGCGTGATTTGTATATTTCACTGGTTTAAAAAACAAACATCGA 
CCAAATGGAAG 

Figure 23 

GGCACGAGCGCAGGAGACACAGGCGCTGGCTGCCCCGTCCGCTCTCCGCCTCCGCCGCGCCCTCCTCGCC 
CGGG ATGGGCCCCCCCGCCGCCGCCGGATCCCTCGCCTCCCGGCCGCCGCCGTTGCGCTCGCCGCGCTCG 
CACTGAAGCCCGGGCCCTCGCGCGCCGCGGTTCGCCCCGCAGCCTCGCCCCCTGCCCACCCGGGCGGCCG 
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TAGGG CGGTC ACG ATGCTGCCGCCCTTACCCTCCCGCCTCGGGCTGCTGCTGCTGCTGCTCCTGTGCCCG 

GCGCACGTCGGCGGACTGTGGTGGGCTGTGGGCAGCCCCTTGGTTATGGACCCTACCAGCATCTGCAGGA 

AGGCACGGCGGCTGGCCGGGCGGCAGGCCGAGTTGTGCCAGGCTGAGCCGGAAGTGGTGGCAGAGCTAGC 

TCGGGGCGCCCGGCTCGGGGTGCGAGAGTGCCAGTTCCAGTTCCGCTTCCGCCGCTGGAATTGCTCCAGC 

CACAGCAAGGCCTTTGGACGCATCCTGCAACAGGACATTCGGGAGACGGCCTTCGTGTTCGCCATCACTG 

CGGCCGGCGCCAGCCACGCCGTCACGCAGGCCTGTTCTATGGGCGAGCTGCTGCAGTGCGGCTGCCAGGC 

GCCCCGCGGGCGGGCCCCTCCCCGGCCCTCCGGCCTGCCCGGCACCCCCGGACCCCCTGGCCCCGCGGGC 

TCCCCGGAAGGCAGCGCCGCCTGGGAGTGGGGAGGCTGCGGCGACGACGTGGACTTCGGGGACGAGAAGT 

CGAGGCTCTTTATGGACGCGCGGCACAAGCGGGGACGCGGAGACATCCGCGCGTTGGTGCAACTGCACAA 

CAACGAGGCGGGCAGGCTGGCCGTGCGGAGCCACACGCGCACCGAGTGCAAATGCCACGGGCTGTCGGGA 

TCATGCGCGCTGCGCACCTGCTGGCAGAAGCTGCCTCCATTTCGCGAGGTGGGCGCGCGGCTGCTGGAGC 

GCTTCCACGGCGCCTCACGCGTCATGGGCACCAACGACGGCAAGGCCCTGCTGCCCGCCGTCCGCACGCT 

CAAGCCGCCGGGCCGAGCGGACCTCCTCTACGCCGCCGATTCGCCCGACTTTTGCGCCCCCAACCGACGC 

ACCGGCTCCCCCGGCACGCGCGGTCGCGCCTGCAATAGCAGCGCCCCGGACCTCAGCGGCTGCGACCTGC 

TGTGCTGCGGCCGCGGGCACCGCCAGGAGAGCGTGCAGCTCGAAGAGAACTGCCTGTGCCGCTTCCACTG 

GTGCTGCGTAGTACAGTGCCACCGTTGCCGTGTGCGCAAGGAGCTCAGCCTCTGCCTGTGACCCGCCGCC 

CGGCCGCTAGACTGACTTCGCGCAGCGGTGGCTCGCACCTGTGGGACCTCAGGGCACCGGCACCGGGCGC 

CTCTCGCCGCTCGAGCCCAGCCTCTCCCTGCCAAAGCCCAACTCCCAGGGCTCTGGAAATGGTGAGGCGA 

GGGGCTTGAGAGGAACGCCCACCCACGAAGGCCCAGGGCGCCAGACGGCCCCGAAAAGGCGCTCGGGGAG 

CGTTTAAAGGACACTGTACAGGCCCTCCCTCCCCTTGGCCTCTAGGAGGAAACAGTTTTTTAGACTGGAA 

AAAAGCCAGTCTAAAGGCCTCTGGATACTGGGCTCCCCAGAACTGCTGGCCACAGGATGGTGGGTGAGGT 

TAGTATCAATAAAGATATTTAAACCAAAAAAAAAAAAAAAAAAAAA 



Figure 24 

CACGCGTCCGGGCCAATCGGGACTATGAACCGGAAAGCGCTGCGCTGCCTGGGCCACCTCTTTC 

TCAGCCTGGGCATGGTCTGCCTCCGGATCGGTGGCTTCTCCTCAGTGGTAGCTCTGGGCGCAAC 

GATCATCTGTAACAAGATCCCAGGCCTGGCTCCCAGACAGCGGGCGATCTGCCAGAGCCGGCC 

CGACGCCATCATCGTCATAGGAGAAGGCTCACAAATGGGCCTGGACGAGTGTCAGTTTCAGTTC 

CGCAATGGCCGCTGGAACTGCTCTGCACTGGGAGAGCGCACCGTCTTCGGGAAGGAGCTCAAA. 

GTGGGGAGCCGGGACGGTGCGTTCACCTACGCCATCATTGCCGCCGGCGTGGCCCACGCCATC 

ACAGCTGCCTGTACCCATGGCAACCTGAGCGACTGTGGCTGCGACAAAGAGAAGCAAGGCCAG 

TACCACCGGGACGAGGGCTGGAAGTGGGGTGGCTGCTCTGCCGACATCCGCTACGGCATCGGC 

TrCGCCAAGGTCTTTGTGGATGCCCGGGAGATCAAGCAGAATGCCCGGACTCTCATGAACTTGC 

ACAACAACGAGGCAGGCCGAAAGATCCTGGAGGAGAACATGAAGCTGGAATGTAAGTGCCAC 

GGCGTGTCAGGCTCGTGCACCACCAAGACGTGCTGGACCACACTGCCACAGTTTCGGGAGCTG 

GGCTACGTGCTCAAGGACAAGTACAACGAGGCCGTTCACGTGGAGCCTGTGCGTGCCAGCCGC 

aacaagcggcccaccttcctGaagatcaagaagccactgtcgtaccgcaagcccatggacacg 

GACCTGGTGTACATCGAGAAGTCGCCCAACTACTGCGAGGAGGACCCGGTGACCGGCAGTGTG 

GGCACCCAGGGCCGCGCCTGCAACAAGACGGCTCCCCAGGCCAGCGGCTGTGACCTCA^ 

TGTGGGCGTGGCTACAACACCCACCAGTACGCCCGCGTGTGGCAGTGCAACTGTAAGTTCCACT 

GGTGCTGCTATGTCAAGTGCAACACGTGCAGCGAGCGCACGGAGATGTACACGTGCAAGTGAG 

CCCCGTGTGCACACCACCCTCCCGCTGCAAGTCAGATTGCTGGGAGGACTGGACCGTTOCAAG 

CTGCGGGCTCCCTGGCAGGATGCTXjAGCTTGTCTI^ 

TCCTGCAGGCATCCGTGGGGGAAAAAAAATCTCTCAGAACCCTCAACTATTCTGTTCCACACCC 
AATGCTGCTCCACCCTCCCCCAGACACAGCCCAAGTCCCTCCGCGGCTGGAGCGAAGCCTTCTG 
CAGCAGGAACTCTGGACCCCTGGGCCTCATCACAGCAATATTTAACAATTTATTCTGATAAAAA 

TAATATTAATTTATTTAATTAAAAAGA^ 
AAAAGGGGGG 
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TCCGCTTACACACCAAGGAAAGTTGGGCTTTGAAGAATTCCATCCCCATGGCCACTGGAGGAA 

GAATAmCNCCCGTCTTGCTTACCCATCT 
AGAGGATTATGTTTCTTTCAAAGCCTTCTGTGTACATCTC 

AGCCACAGCTGGTCGGTGAACAATITCCTGATGACTGGTCCAAAGGCTTACCTGATTTACTCCA 

GCAGTGTGGCAGCTGGTGCCCAGAGTGGTATTGAAGAATGCAAGTATCAGTTTGCCTGGGACC 

GCTGGAACTGCCCTGAGAGAGCCCTGCAGCTGTCCAGCCATGGTGGGCTTCGCAGTGCCAATCG 

GGAGACAGCATTTGTGCATGCCATCAGTTCTGCTGGAGTCATGTACACCCTGACTAGAAACTGC 

AGCCTTGGAGATTTTGATAACTGTGGCTGTGATGACTCCCGCAACGGGCAACTGGGGGGACAA 

GGCTGGCTGTGGGGAGGCTGCAGTGACAATGTGGGCTTCGGAGAGGCGATTTCCAAGCAGTTT 

GTCGATGCCCTGGAAACAGGACAGGATGCACGGGCAGCCATGAACCTGCACAACAACGAGGCT 

GGCCGCAAGGCGGTGAAGGGCACCATGAAACGCACGTGTAAGTGCCATGGCGTGTCTGGCAGC 

TGCACCACGCAGACCTGTTGGCTGCAGCTGCCCGAGTTCCGCGAGGTGGGCGCGCACCTGAAG 

GAGAAGTACCACGCAGCACTCAAGGTGGACCTGCTGCAGGGTGCTGGCAACAGCGCGGCCGCC 

CGCGGCGCCATCGCCGACACCTTTCGCTCCATCrCTACCCGGGAGCTGGTGCACCTGGAGGACT 

CCCCGGACTACTGCCTGGAGAACAAAACGCTAGGGCTGCTGGGCACCGAAGGCCGAGAGTGCC 

TAAGGCGCGGGCGGGCCCTGGGTCGCTGGGAACTCCGCAGCTGCCGCCGGCTCTGCGGGGACT 

GCGGGCTGGCGGTGGAGGAGCGCCGGGCCGAGACCGTGTCCAGCTGCAACTGCAAGTTCCACT 

GGTGCTGTGCAGTCCGCTGCGAGCAGTGCCGCCGGAGGGTCACCAAGTACTTCTGTAGCCGCGC 

AGAGCGGCCGCGGGGGGGCGCTGCGCACAAACCCGGGAGAAAACCCTAAGGGTTTCCTCTGCC 

CCCTCCTTTTCCCACTGGTTCTTGGCTTCCTITAGAGACCCCGGTAATTGTGGAACCTAGGGAAT 

GGGGAACCCGCTCTCCCAGACCTAGGGATCCTGAAAGGGAAAAACTGCAATTTCTCCAAAGCT 

TGCCACTTTCCAGCCTGTTTCCCCAATTCCTCTGTGCTCTCCrAAAGCTCTGTCTGAATCCTCGC 

AGCCACACCTAGGTCTGAAAACTCAGGCTTTGAGTTACTGATCTrCCTrGGATTAGGAAAACAG 

GTGTTCCTCCTCCCCTCTCCTATCAGCCCTAATCTCTGACCTAGCCTATCAACCCTTAGGCGCTG 

GAAAAACCTTCrCATACACGCAGGACCCAGGTTAACTCAAAGCTTTGCCcmTrGCCCACTGTC 

TGCrACCAGGGGCTCACCCTCTGCTGCACCTCTCTTCTGCACAGCTCCTCCCCTGCTACTGCTGA 

CCAAATTCCCAGGAATCTTGAATGCTTTCTCTCCTCTTCrCCCTTTC 

AGGAAACTGGCCCCGGAAAAGCATGTCTTTGGGGTTGGTTCCTAGAGGCAGAGGTTGAAGATG 

GAAGAGGGAGCTCTGGAGTGCTAACTTGAACACCAAGGGTGCTACTCATCCCTATGGTATCATA 

TCATGAATGGACTTTACTAGTGGGGCAATGACTTTCCTAGACAATAACCCGAGGGACTCCAGAT 

ACATACCCCGAAGGTCTAGGAAATACGTTAAGGGCAGATTACAGTCATTTCCTACCCTTTAAAG 

GTAACTTCTCCCTTCTCCTGACCTACTTCCTCCT^ 

ATCTITGTTCCTCTGAGCCAAGACTGAGGTAAATAAAGCCACTTTCCTCITCAGATCCTGGTCTG 
CACCTCTAGA 



Figure 26 

GCGGCCGCGTCGACGGAGGGGCTGCAGCTCCGTCAGCCCGGCAGAGCCACCCTGAGCTCGGTG 

AGAGCAAAGCCAGAGCCCCCAGTCCTTTGCTCGCCGGCTTGCTATCTCTCTCGATCACTCCCTCC 

CTTCCTCCCTCCCTTCCTCCCGGCGGCCGCGGCGGCGCTGGGGAAGCGGTGAAGAGGAGTGGCC 

CGGCCCTGGAAGAATGCGGCrCTGACAAGGGGACAGAACCCAGCGCAGTCTCCCCACGGTTTA 

AGCAGCACTAGTGAAGCCCAGGCAACCCAACCGTGCCTGTCTCGGACCCCGCACCCAAACCAC 

TGGAGGTCCTGATCGATCTGCCCACCGGAGCCTCCGGGCTTCGACATGCTGGAGGAGCCCCGGC 

CGCGGCCTCCGCCCTCGGGCCTCGCGGGTCTCCTGTTCCTGGCGTTGTGCAGTCGGGCTCTAAG 

CAATGAGATTCTGGGCCTGAAGTTGCCTGGCGAGCCGCCGCTGACGGCCAACACCGTGTGCTTG 

ACGCTGTCCGGCCTGAGCAAGCGGCAGCTAGACCTGTGCCTGCGCAACCCCGACGTGACGGCG 

TCCGCGCTTCAGGGTCTGCACATCGCGGTCCACGAGTGTCAGCACCAGCTGCGCGACCAGCGCT 

GGAACTGCTCCGCGCTTGAGGGCGGCGGCCGCCTGCCGCACCACAGCGCCATCCTCAAGCGCG 

GTTTCCGAGAAAGTGCTTTTTCCTTCTCCATGCTGGCTGCTGGGGTCATGCACGCAGTAGCCAC 

GGCCTGCAGCCTGGGCAAGCTGGTGAGCTGTGGCTGTGGCTGGAAGGGCAGTGGTGAGCAGGA 

TCGGCTGAGGGCCAAACTGCTGCAGCTGCAGGCACTGTCCCGAGGCAAGAGTTTCCCCCACTCT 

CTGCCCAGCCCTGGCCCTGGCTCAAGCCCCAGCCCTGGCCCCCAGGACACATGGGAATGGGGT 

GGCTGTAACCATGACATGGACTTTGGAGAGAAGTTCTCTCGGGATTTCTTGGATTCCAGGGAAG 

CTCCCCGGGACATCCAGGCACGAATGCGAATCCACAACAACAGGGTGGGGCGCCAGGTGGTAA 

CTGAAAACCTGAAGCGGAAATGCAAGTGTCATGGCACATCAGGCAGCTGCCAGTTCAAGACAT 



WO 03/012082 



PCT/GB02/03409 



22/41 

AGCCTGTGCTGTGGCOTTGGGCACAACGTGCTCCGGCAGACACGAGTTGAGCGCTG 
G^CACTGGTGCTGCTATGTGCTGTGTGATGAGTGCAAGG 

?aa^tg!?got^ 

GAAACAGCmAGGGGTGGTGGGGGTCAGGTGGACTCTGGGATGTGTAGCCTTCTCCCCAACA 

atogIgggtotgaggggaagctgccacccctcttctgct^ 

GATGAAATGCACTGTATTGCTCCTCCCACTrCT 

CTGTTTGGAGCCATAAGATCTGTATCTAGAAAGAGATCACCCACTCCTATGTACT 
CTC(mTACTGCAGCCTGGGCTCCCTCTTGTGGGATAATGGGAGACAGTGGTAGAGAGGTTnT 

GGCCCTTAGGGAAGTTGTCTCCTTCCATTCAGATGTTAATGGGGACCCTCCAAAGGAAGGGGTT 
TTCCCATGACTCTTGGAGCCTCTTTTTCCTTCTTC^ 

ATACrGAGACTrGTTCTTGGTTCCTGTTTGAAACTAAAATAAATTAAGTTACTGGAAAAAAA^ 
AAAAAAAAA 



Figure 27 

TAACCCGCCGCCTCCGCTCTCCCCGGCTGCAGGCGGCGTGCAGGACCAGCGGCGGCCGTGCAG 

GCGGAGGACTTCGGCGCGGCTCCTCCTGGGTGTGACCCCGGGCGCGCCCGCCGCGCGACGATG 

AGGGCGCGGCCGCAGGTCTGCGAGGCGCTGCTCTTCGCCCTGGCGCTCCAGACCGGCGTGTGCT 

ATGGCATCAAGTGGCTGGCGCTGTCCAAGACACCATCGGCCCTGGCACTGAACCAGACGCAAC 

ACTGCAAGCAGCTGGAGGGTCTGGTGTCTGCACAGGTGCAGCTGTGCCGCAGCAACCTGGAGC 

TCATGCACACGGTGGTGCACGCCGCCCGCGAGGTCATGAAGGCCTGTCGCCGGGCCTTTGCCGA 

CATGCGCTGGAACTGCTCCTCCATTGAGCTCGCCCCCAACTATTTGCTTGACCTGGAGAGAGGG 

ACCCGGGAGTCGGCCTTCGTGTATGCGCTGTCGGCCGCCACCATCAGCCACGCCATCGCCCGGG 

CCTGCACCTCCGGCGACCTGCCCGGCTGCTCCTGCGGCCCCGTCCCAGGTGAGCCACCCGGGCC 

CGGGAACCGCTGGGGAAGATGTGCGGACAACCTCAGCTACGGGCTCCTCATGGGGGCCAAGTT 

TTCCGATGCTCCTATGAAGGTGAAAAAAACAGGATCCCAAGCCAATAAACTGATGCGTCTACA 

CAACAGTGAAGTGGGGAGACAGGCTCTGCGCGCCTCTCTGGAAATGAAGTGTAAGTGCCATGG 

GGTGTCTGGCTCCTGCTCCATCCGCACCTGCTGGAAGGGGCTGCAGGAGCTGCAGGATGTGGCT 

GCTGACCTCAAGACCCGATACCTGTCGGCCACCAAGGTAGTGCACCGACCCATGGGCACCCGC 

AAGCACCTGGTGCCCAAGGACCTGGATATCCGGCCTGTGAAGGACTGGGAACTTGTTTATTTGC 

AGAGCTCACCTGACTTTTGCATGAAGAATGAGAAGGTGGGCTCCCACGGGACACAAGACAGGC 

AGTGCAACAAGACTTCCAACGGAAGCGACAGCTGCGACCTTATGTGCTGCGGGCGTGGCTACA 

ACCCCTACACAGACCGCGTGGTCGAGCGGTGCCACTGTAAGTACCACTGGTGTTGCTACGTCAC 

CTGCCGCAGGTGTGAGCGTACCGTGGAGCGCTATGTCTGCAAGTGAGGCCCTGCCCTCCGCCCC 

ACGCAGGAGCGAGGACTTTGCTCAAGGACCCTCAGCAACTGGGGCCGGGGGCCTGGAGACACT 

CCATGGAGCTCTGCTTGTGAATTCCAGATGCCAGGCATGGGAGGCGGCTTGTGCTTTGCCTTCA 

CTTGGAAGCCACCAGGAACAGAAGGTCTGGCCACCCTGGAAGGAGNGCAGGACATCAAAGGA 

AACCGACAAGATTAAAAATAACTTGGCAGCCTGAGOTCTGGAGTGCCCACAGNOTGGTGT 

GAGCGGGGCTTGGGATCGGTGAGACTGATACAGACTTGACCTTTCAGGGCCACAGAGACCAGC 

CTCCGGGAAGGGGTCTGCCCGCCTTCTTCAGAATGTTCTGCGGGACCCCCTGGCCCACCCTGGG 

GTCTGAGCCrGCTGGGCCACCACATGGAATCACTAGCTTCGGGTTGTAAATGTmCTTTTGTTT 

NTTGCTTITTCTTCCTTTGGGATGTTGGAAGCTACAGAAATATTTATAAA^ 
TGGGGTGGCACTTCTCAATrCCTCTTTATATATTTTANATATATAAATATATATGTATATATATA 
ATGATCTCTAATNTAAAACTAGCTTTTTAA.GCAGCTGTATGAAATAAATGCTGAGTGAGCCCCA. 
GCCCGCCCCTGCAGTTCCCGGCCTCGTCAAGTGAACTCGGCAGACCCTGGGGCTGGCAGAGGG 

AGCTCTCCAGTTTCCGGGCA 
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GGCGCGGCAAGATGCTGGATGGGTCCCCGCTGGCGCGCTGGCTGGCCGCGGCCTTCGGGCTGA 

CGCTGCTGCTCGCCGCGCTGCGCCCTTCGGCCGCCTACTTCGGGCTGACGGGCAGCGAGCCCCT 

GACCATCCTCCCGCTGACCCTGGAGCCAGAGGCGGCCGCCCAGGCGCACTACAAGGCCTGCGA 

CCGGCTGAAGCTGGAGCGGAAGCAGCGGCGCATGTGCCGCCGGGACCCGGGCGTGGCAGAGA 

CGCTGGTGGAGGCCGTGAGCATGAGTGCGCTCGAGTGCCAGTTCCAGTTCCGCTTTGAGCGCTG 

GAACTGCACGCTGGAGGGCCGCTACCGGGCCAGCCTGCTCAAGCGAGGCTTCAAGGAGACTGC 

CTTCCTCTATGCCATCTCCrCGGCTGGCCTGACGCACGCACTGGCCAAGGCGTGCAGCGCGGGC 

CGCATGGAGCGCTGTACCTGCGATGAGGCACCCGACCTGGAGAACCGTGAGGCCTGGCAGTGG 

GGGGGCTGCGGAGACAACCTTAAGTACAGCAGCAAGTTCGTCAAGGAATTCCTGGGCAGACGG 

TCAAGCAAGGATCTGCGAGCCCGTGTGGACTTCCACAACAACCTCGTGGGTGTGAAGGTGATC 

AAGGCTGGGGTGGAGACCACCTGCAAGTGCCACGGCGTGTCAGGCTCATGCACGGTGCGGACC 

TGCTGGCGGCAGTTGGCGCCTTTCCATGAGGTGGGCAAGCATCTGAAGCACAAGTATGAGACG 

GCACTCAAGGTGGGCAGCACCACCAATGAAGCTGCCGGCGAGGCAGGTGCCATCTCCCCACCA 

CGGGGCCGTGCCTCGGGGGCAGGTGGCAGCGACCCGCTGCCCCGCACTCCAGAGCTGGTGCAC 

CTGGATGACTCGCCTAGCTTCTGCCTGGCTGGCCGCTTCTCCCCGGGCACCGCTGGCCGTAGGT 

GCCACCGTGAGAAGAACTGCGAGAGCATCTGCTGTGGCCGCGGCCATAACACACAGAGCCGGG 

TGGTGACAAGGCCCTGCCAGTGCCAGGTGCGTTGGTGCTGCTATGTGGAGTGCAGGCAGTGCA 

CGCAGCGTGAGGAGGTCTACACCTGCAAGGGCTGAGTTCCCAGGCCCTGCCAGCCCTGCTGCA 

CAGGGTGCAGGCATTGCACACGGTGTGAAGGGTCTACACCTGCACAGGCTGAGTTCCTGGGCT 

CGACCAGCCCAGCTGCGTGGGGTACAGGCATTGCACACAGTGTGAATGGGTCTACACCTGCAT 

GGGCTGAGTCCCTGGGCTCAGACCTAGCAGCGTGGGGTAGTCCCTGGGCTCAGTCCTAGCTGCA 

TGGGGTGCAGGCATTGCACAGAGCATGAATGGGCCTACACCTGCCAAGGCTGAATCCCTGGGC 

CCAGCCAGCCCTGCTGCACATGGCACAGGCATTGCACACGGTGTGAGGAGTGTACACCTGCAA 

GGGCTGAGGCCCTGGGCCCAGTCAGCCCTGCTGCTCAGAGTGCAGGCATTGCACATGGTGTGA 

GAAGGTCTACACCTGCAAGGGACGAGTCCCCGGGCCTGGCCAACCCTGCTGTGCAGGGTGAGG 

GCCATGCATGCTAGTATGAGGGGTCTACACCTGCAAGGACTGAGAGGCTTTT 



Figure 29 

AGCCTGCAAAAACCACAGAGGGCAAAGCCAGAAAGATGGAAAGGCACCCACCCATGCAGCTC 

ACCACTTGCCTCAGGGAGACCCTCTTCACAGGGGCTTCTCAAAAGACCTCCCTATGGTGGTTGG 

GCATTGCCTCCITCGGGGTTCCAGAGAAGCTGGGCTGCGCCAATTTGCCGCTGAACAGCCGCCA 

GAAGGAGCTGTGCAAGAGGAAACCGTACCTGCTGCCGAGCATCCGAGAGGGCGCCCGGCTGGG 

CATTCAGGAGTGCAGGAGCCAGTTCAGACACGAGAGATGGAACTGCATGATCACCGCCGCCGC 

CACTACCGCCCCGATGGGCGCCAGCCCCCTCTTTGGCTACGAGCTGAGCAGCGGCACCAAAGA 

GACAGCATTTATTTATGCTGTGATGGCTGCAGGCCTGGTGCATTCTGTGACCAGGTCATGCAGT 

GCAGGCAACATGACAGAGTGTTCCTGTGACACCACCTTGCAGAACGGCGGCTCAGCAAGTGAA 

GGCTGGCACTGGGGGGGCTGCTCCGATGATGTCCAGTATGGCATGTGGTTCAGCAGAAAGTTCC 

TAGATTTCCCCATCGGAAACACCACGGGCAAAGAAAACAAAGTACTATTAGCAATGAACCTAC 

ATAACAATGAAGCTGGAAGGCAGGCTGTCGCCAAGTTGATGTCAGTAGACTGCCGCTGCCACG 

GAGTITCCGGCTCCTGTGCTGTGAAAACATGCTGGAAAACCATGTCTTCTTTTGAAAAGATTGG 

CCATTTGTTGAAGGATAAATATGAAAACAGTATCCAGATATCAGACAAAATAAAGAGGAAAAT 

GCGCAGGAGAGAAAAAGATCAGAGGAAAATACCAATCCATAAGGATGATCTGCTCTATGTTAA 

TAAGTCTCCCAACTACTGTGTAGAAGATAAGAAACTGGGAATCCCAGGGACACAAGGCAGAGA 

ATGCAACCGTACATCAGAGGGTGCAGATGGCTGCAACCTCCTCTGCTGTGGCCGAGGTTACAAC 

ACCCATGTGGTCAGGCACGTGGAGAGGTGTGAGTGTAAGTTCATCTGGTGCTGCTATGTCCGTT 

GCAGGAGGTGTGAAAGCATGACTGATGTCCACACTTGCAAGTAACCACTCCATCCAGCCTTGG 

GCAAGATGCCTCAGCAATATACAATGGCATTGCAACCAGAGAGGTGCCCATCCCTGTGCAGCG 

CTAGTAAAGTTGACTCTTGCAGTGGAATCCC 
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AGTTGAGGGATTGACACAAATGGTCAGGCGGCGGCGGCGGAGAAGGAGGCGGAGGCGCAGGG 
GGGAGCCGA^CCGCTGGG 

gcgcc^gcca^Sggagccagcgagc 

CCTAGCCACCCGGG^CT^^ 
GGCACCCGCGCAGAGGTCTC 

cccagSgagcggcgS^ 

ACTCCCGGGCC^ 

tcScggaggaSgcagcggggacgccggtggccgccgccgcccgccag^^ 

TGGCGCGCCAGCTGCTGCrGCTGCITroGCTGCT 

ccaggcggcgggccaggggccaggccaggggcccgggccggggcagcaaccgccgccgccgc 

CTCAGCAGCAACAGAGCGGGCAGCAGTACAACGG^^ 
GCTA^GCCAGCCCATCTCCATCCCGCT^^ 

2aacSgctgggccacac^ 

actgSaot^agtgWgctgagctc 

aSg^agagcaggcgct^ 

gaggcgctcatgaacaagttcggcttccagtggccagacacgctcaagtgtgagaagttcccg 
gtgcacggo^ 

SctcctacctcaXctaccac^ 

aggtgtatgggctcatgtacttcgggcccgaggagctgcgcttctcgcgcacctggattggcat 
^ggtcagtgctgt^ 

CGC^CAGCTACCCGGAGCGGCCCATCATCTTCTTGTCCGGCTGTTACACGGCCGTGGCCGTGG 

CCTACATCGCCGGCTTCCTCCTGGAAGACCGAGTGGTGTGTAATGACAAGTTCGCCGAGGACGG 

GGCACGCACTGTGGCGCAGGGCACCAAGAAGGAGGGCTGCACCATCCTOTCATGATGCTCTA 

CTTCTTCAGCATGGCCAGCrCCATCTGGTGGGT^ 

g^tcaTgVggg^ccacgackjccatcgaagcc^ 

CTGTG^CCGGCCATCAAGACCATCACCATCCTGGCGCTGGGCCAGGTGGACGGCGATGTGCTGA 

gcggagtgtgcttcgtggggcttaacaacgtggacgcgctgcgtggcttcgtgctggcgcccct 
otcgtgtacctgt™ 

caccatcatgaagcacgatggcaccaagaccgagaagctggagaagctcatggtgcgca^gg 

CGTCTTCAGCGTGCTGTACACTGTGCCAGCCACCATCGTCATCGCCTGCTACTTCTACGAG 

GCCrTCCGGGACCAGTGGGAACGCAGCTGGGTGGCCCAGAGCTGCAAGAGCTACGC^CCCC 

TGCCCTCACCTCCAGGCGGGCGGAGGCGCCCCGCCGCACCCGCCCATGAGCCCGGACTTCACG 

GTCTTCATGATrAAGTACCTTATGACGCTGATC 

CGGCAAGACCCTCAACTCCTGGAGGAAGTTCTACACGAGGCrCACCAACAGCAAACAAGGGGA 
GACTACAGTCTGAGACCCGGGGCTCAGCCCATGCCCAGGCCTCGGCCGGGGCGCAGCGATCCC 

CCAAAGCCAGCGCCGTGGAGTTCGTGCCAATCCTGACATCTCGAGGTTTC 

CTCTTTCGCAGGCTCCTTTGAACAACTCAGCTCCTGCAAAAGCTTCCGTCCCTGAGGC 

ACACGAGGGCCCGACTGCCAGAGGGAGGATGGACAGACCTCTrGCCCTCACACTCTGGIACCA. 

GGACTGTTCGCTTTTATGATTGTAAATAGCCTGTGTAAGA^ 

ATGACGACCGATCACGCGTTITrCTTTTTCAAAAGTTTTT 

TGAGGCTTITCCITCTTGCCCTTTTCGGAGTATTGCAAAGGAGCTAAAACT 

ACAGCGCTCCTGGTCGTCCTCGCGCGCCTCTCCCTACCACGGGTGCTCGGGACGGCTGGGCGCC 

AGCTCCGGGGCGAGTTCAGCACTGCGGGGTGCGACTAGGGCTGCGCTGCCAGGGTCAOTCCC 

GCCTCCTCClTITGCCCCCTCCCCCTCCITCTGTCCCCTCCCTrrCTTTCCTGGCnTGAG^ 

GCTCTTAAGGTACAGAACTCCACAAACCTTCCAAATCTC 

TCCTCCCTTGCTCGGCGGTGGATTGCGAAGGCCCGTCCCTTCGACTTCCTGAAGCTGGA^ 
ACTGTCCAGAACTTTCCTCCAACTTCATGGGGGCCCACGGGTC 
TCCCTCCACGGTCACCTTCAACGCCCAGACACTCCCTTCTCCCACC^ 
GTGAGATAACCAATGCCAAACTTTTTGAAGTCTAATTTTTGAGGGGTGAGCTCATTTCA^ 

AGTGTCTAAAACCTGGTATGGGTITGGCCAGCGTCATGGA 

AAGAAGCATGAAGCTITGTGTGGGTTGGAAGAGACTGAAGATATGGGTTATAAAATG^AA^ 
CTAATTGCATACGGATGCCTGGCAACCTTGCCTTTGAGAA^^ 

ACCGGTCTGTAAAATGGAAATGTTGAGGTCACCTGGAAAGCTTTGTTAAGGAGTTGATGTTTGC 
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TITCCITAACAAGACAGCAAAACGTAAACAGAAATTGAAAACTTGAAGGATATTTCAGTGTCAT 

GGACTTCCTCAAAATGAAGTGCTATTITCTrATTTITAATCAAATAACTAGACATATATCAGAA 

ACmAAAATGTAAAAGTTGTACACTTTCAACATmATTACGATTATTATTCAGCAGCACATTC 

TGAGGGGGGAACAATTCACACCACCAATAATAACCTGGTAAGATTTCAGGAGGTAAAGAAGGT 

GGAATAATTGACGGGGAGATAGCGCCTGAAATAAACAAAATATGGGCATGCATGCTAAAGGG 

AAAATGTGTGCAGGTCTACTGCATTAAATCCTGTGTGCTCCTCTITrGGATTTACAGAAATGTGT 

CAAATGTAAATCTTTCAAAGCCATTTAAAAATATTCACTTTAGTTCTCTGTGAAGAAGAGGAGA 

AAAGCAATCCTCCTGATTGTATTGTTTTAAACTTTAAGAATTTATCAAAATGCCGGTACTTAGG 

ACCTAAATITATCTATGTCTGTCATACGCTAAAATGATATrGGTCTITGAATTrGGTATACATTT 

ATTCTGTTCACTATCACAAAATCATCTATATTTATAGAGGAATAGAAGTTTATATATATATAATA 

CCATATTTTTAATTTCACAAATAAAAAATTCAAAGTT^ 

TGAAAATAATAGAGCTTGAGCTGTCTGAACTATTTTACATTTTATGGTGTCTCATAGCCAATCCC 
ACAGTGTAAAAATTCA 



Figure 31 

CGAGTAAAGTTTGCAAAGAGGCGCGGGAGGCGGCAGCCGCAGCGAGGAGGCGGCGGGGAAGA 

AGCGCAGTCTCCGGGTTGGGGGCGGGGGCGGGGGGGGCGCCAAGGAGCCGGGTGGGGGGCGG 

CGGCCAGCATGCGGCCCCGCAGCGCCCTGCCCCGCCTGCTGCTGCCGCTGCTGCTGCTGCCCGC 

CGCCGGGCCGGCCCAGTTCCACGGGGAGAAGGGCATCTCCATCCCGGACCACGGCTTCTGCCA 

GCCCATCTCCATCCCGCTGTGCACGGACATCGCCTACAACCAGACCATCATGCCCAACCTTCTG 

GGCCACACGAACCAGGAGGACGCAGGCCTAGAGGTGCACCAGTTCTATCCGCTGGTGAAGGTG 

CAGTGCTCGCCCGAACTGCGCTTCTTCCTGTGCTCCATGTACGCACCCGTGTGCACCGTGCTGG 

AACAGGCCATCCCGCCGTGCCGCTCTATCTGTGAGCGCGCGCGCCAGGGCTGCGAAGCCCTCAT 

GAACAAGTTCGGTTTTCAGTGGCCCGAGCGCCTGCGCTGCGAGCACTTCCCGCGCCACGGCGCC 

GAGCAGATCTGCGTCGGCCAGAACCACTCCGAGGACGGAGCTCCCGCGCTACTCACCACCGCG 

CCGCCGCCGGGACTGCAGCCGGGTGCCGGGGGCACCCCGGGTGGCCCGGGCGGCGGCGGCGCT 

CCCCCGCGCTACGCCACGCTGGAGCACCCCTTCCACTGCCCGCGCGTCCTCAAGGTGCCATCCT 

ATCTCAGCTACAAGTTTCTGGGCGAGCGTGATTGTGCTGCGCCCTGCGAACCTGCGCGGCCCGA 

TGGTTCCATGTTCITCTCACAGGAGGAGACGCGTTTCGCGCGCCTCTGGATCCTCACCTGGTCG 

GTGCTGTGCTGCGCTTCCACCTTCITCACTGTCACCACGTACTTGGTAGACATGCAGCGCTTCCG 

CTACCCAGAGCGGCCTATCATTTTTCTGTCGGGCTGCTACACCATGGTGTCGGTGGCCTACATC 

GCGGGCTTCGTGCTCCAGGAGCGCGTGGTGTGCAACGAGCGCTTCTCCGAGGACGGTTACCGC 

ACGGTGGTGCAGGGCACCAAGAAGGAGGGCTGCACCATCCTCTTCATGATGCTCTACTTCTTCA 

GCATGGCCAGCTCCATCTGGTGGGTCATCCTGTCGCTCACCTGGTTCCTGGCAGCCGGCATGAA 

GTGGGGCCACGAGGCCATCGAGGCCAACTCTCAGTACTTCCACCTGGCCGCCTGGGCCGTGCCG 

GCCGTCAAGACCATCACCATCCTGGCCATGGGCCAGATCGACGGCGACCTGCTGAGCGGCGTG 

TGCTTCGTAGGCCTCAACAGCCTGGACCCGCTGCGGGGCTTCGTGCTAGCGCCGCTCTTCGTGT 

ACCTGTTCATCGGCACGTCCTTCCTCCTGGCCGGCTTCGTGTCGCTCTTCCGCATCCGCACCATC 

ATGAAGCACGACGGCACCAAGACCGAAAAGCTGGAGCGGCTCATGGTGCGCATCGGCGTCTTC 

TCCGTGCTCTACACAGTGCCCGCCACCATCGTCATCGCTTGCTACTTCTACGAGCAGGCCTTCCG 

CGAGCACTGGGAGCGCTCGTGGGTGAGCCAGCACTGCAAGAGCCTGGCCATCCCGTGCCCGGC 

GCACTACACGCCGCGCATGTCGCCCGACTTCACGGTCTACATGATCAAATACCTCATGACGCTC 

ATCGTGGGCATCACGTCGGGCTTCTGGATCTGGTCGGGCAAGACGCTGCACTCGTGGAGGAAG 

TTCTACACTCGCCTCACCAACAGCCGACACGGTGAGACCACCGTGTGAGGGACGCCCCCAGGC 

CGGAACCGCGCGGCGCTTTCCrCCGCCCGGGGTGGGGCCCCTACAGACrCCGTATTTTAl llllT 

TAAATAAAAAACGATCGAAACCATTTCACTTTTAGGTTGCTTTTTAAAAGAGAACT 

AACACCCCC 

Figure 32 

GCCGCTCCGGGTACCTGAGGGACGCGCGGCCGCCCGCGGCAGGCGGTGCAGCCCCCCCCCACC 
CCTTGGAGCCAGGCGCCGGGGTCTGAGGATAGCATTTCTCAAGACCTGACTTATGGAGCACTTG 
TAACCTGAGATATTTCAGTTGAAGGAAGAAATAGCTCTTCTCCTAAGATGGAATCTGTGGTTTG 
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GGAATGTGGTTGATCAACTTGATATGTTGGCCAAATGTGCCCCATGTAATAAAATGAAAAGAA 

GAGACAAGATGATGTCATTTTCCCATATTGTGAAACCAAAAACAAACGCCTTTTGTGAGACCAA 

GCTAACAAACCTCTGACGGTGCGAAGAGTATTTAACTGTTTGAAGAATTTAACAGTAAGATACA 

GAAGAAGTACCTTCGAGCTGAGACCTGCAGGTGTATAAATATCTAAAATACATATTGAATAGG 

CCTGATCATCTGAATCTCCTTCAGACCCAGGAAGGATGGCTATGACITGGATTGTCrTCTCTCTT 

TGGCCCTTGACTGTGTTCATGGGGCATATAGGTGGGCACAGTTTGTTTTCTTGTGAACCTATTAC 

CTTGAGGATGTGCCAAGATTTGCCTTATAATACTACCTTCATGCCrAATCTTCTGAATCATT^ 

ACCAACAGACAGCAGCTTTGGCAATGGAGCCATTCCACCCTATGGTGAATCTGGATTGTTCTCG 

GGATITCCGGCCTTTTCTITGTGCACTCTACGCTCCTATTTGTATGGAATATGGACGTGTCACAC 

TTCCCTGTCGTAGGCTGTGTCAGCGGGCTTACAGTGAGTGTTCGAAGCTCATGGAGATGTTTGG 

TGTTCCTTGGCCTGAAGATATGGAATGCAGTAGGTTCCCAGATTGTGATGAGCCATATCCTCGA 

CTTGTGGATCTGAATTTAGCTGGAGAACCAACTGAAGGAGCCCCAGTGGCAGTGCAGAGAGAC 

TATGGTTTTTGGTGTCCCCGAGAGTrAAAAATTGATCCTGATCTGGGTTATTCTTTTCTGCATGT 

GCGTGATTGTTCACCTCCTTGTCCAAATATGTACTTCAGAAGAGAAGAACTGTCATTTGCTCGCT 

ATTTCATAGGATTGATTTCAATCATITGCCTCTCGGCCACATTGTTTACTTT^ 

TTGATGTCACAAGATTCCGTTATCCTGAAAGGCCTATTATATTTTATGCAGTCTGCTACATGATG 

GTATCCTTAATITrCTTCATTGGATTTTTGCTTGAAGATCGAGTAGCCTGCAATGCATCCATCCC 

TGCACAATATAAGGCTTCCACAGTGACACAAGGATCTCATAATAAAGCCTGTACCATGCTTTTT 

ATGATACTCTATTTTTTTACTATGGCTGGCAGTGTATGGTGGGTAATTCTTACCATCACATGGTT 

TTTAGCAGCTGTGCCAAAGTGGGGTAGTGAAGCTATTGAGAAGAAAGCATTGCTGTTTCACGCC 

AGTGCATGGGGCATCCCCGGAACTCTAACCATCATCCrTTTAGCGATGAATAAAATTGAAGGTG 

ACAATATTAGTGGCGTGTGTTTTGTTGGCCTCTACGATGT^ 

GCTCCCCTCTGCCTGTATGTGGTAGTTGGGGTTTCTCTCCTCTTAGCTGGCATTATATCCCTAAA 

CAGAGTTCGAATTGAGATTCCATTAGAAAAGGAGAACCAAGATAAATTAGTGAAGTTTATGAT 

CCGGATCGGTGTITrCAGCATTCTITATCTCGTACCACTCTTGGTTGTAATTGGATGCTACTTrTA 

TGAGCAAGCTTACCGGGGCATCTGGGAAACAACGTGGATACAAGAACGCTGCAGAGAATATCA 

CATTCCATGTCCATATCAGGTTACTCAAATGAGTCGTCCAGACnTGATTCrrCTTTCTGATGAAAT 

ACCTGATGGCTCTCATAGTTGGCATTCCCrCTGTATTTTGGGTTGGAAGCAAAAAGACATGCTTT 

GAATGGGCCAGTTTTTTTCATGGTCGTAGGAAAAAAGAGATAGTGAATGAGAGCCGACAGGTA 

CTCCAGGAACCTGATTTTGCTCAGTCTCTCCTGAGGGATCCAAATACTCCTATCATAAGAAAGT 

CAAGGGGAACTTCCACTCAAGGAACATCCACCCATGCTTCTTCAACTCAGCTGGCTATGGTGGA 

TGATCAAAGAAGCAAAGCAGGAAGCATCCACAGCAAAGTGAGCAGCTACCACGGCAGCCTCC 

ACAGATCACGTGATGGCAGGTACACGCCCTGCAGTTACAGAGGAATGGAGGAGAGACTACCTC 

ATGGCAGCATGTCACGACTAACAGATCACTCCAGGCATAGTAGTTCTCATCGGCTCAATGAACA 

GTCACGACATAGCAGCATCAGAGATCTCAGTAATAATCCCATGACTCATATCACACATGGCACC 

AGCATGAATCGGGTTATTGAAGAAGATGGAACCAGTGCTTAATTTGTCTTGTCTAAGGTGGAAA 

TCTTGTGCTGTTTAAAAAGCAGATTTTATTCTTTGCCTITrGCATGACTGATAGCTGTACT 

GTTAACATGCTTTCAGTCAAGTACAGATTGTGTCCACTGGAA^ 

TTGCATCAAACTrGGAACATCAAGGCATCCAAAACACTAAGAATTCTATCATCACAAAAATAAT 

TCGTCTITCTAGGTTATGAAGAGATAATTATTTGTCrGGTAAGCATTTTTATAAACCCACrCATT 

TTATATTTAGAAAAATCCTAAATGTGTGGTGACTGCTITGTAGTGAACrTTCATATACTATAAAC 

TAGTTGTGAGATAACATTCTGGTAGCTCAGTTAATAAAAC 

TATGCAAGGTTTACTTCTCAGATGAACAGTAGGACITrGTAGTTTTATTTCCACT 

AAGAACTGTGTTTTTAAACTGTAGGAGAATTTAATAAATCAGCAAGGGTATTTTAGCT 

ATAAAAGTGCAACAGAAGAATTrGATTAGTCTATGAAAGGTTCTCTTAAAATTCTATCGAAATA 

ATCTTCATGCAGAGATATTCAGGGTTTGGATTAGCAGTGGAATAAAGAGATGGGCATTGTTTCC 

CCTATAATTGTGCTGTTTTTATAACTTTTGTAAATATTACTTTTTCTGGCT 

ATCCATATGCATGATGGAAAAATmAATITGTAGCCATCITITCCCATGTAATAGTATTGATrC 

ATAGAGAACTTAATGTTCAAAATTTGCTTTGTGGAGGCATGTAATAAGATAAACATCATACATT 

ATAAGGTAACCACAATTACAAAATGGCAAAACA 

Figure 33 

GCTGCGCAGCGCTGGCTGCTGGCTGGCCTCGCGGAGACGCCGAACGGACGCGGCCGGCGCCGG 
CTTGTGGGCTCGCCGCCTGCAGCCATGACCCTCGCAGCCTGTCCCTCGGCCTCGGCCCGGGACG 
TCTAAAATCCCACACAGTCGCGCGCAGCTGCTGGAGAGCCGGCCGCTGCCCCCTCGTCGCCGCA 
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TCACACTCCCGTCCCGGGAGCTGGGAGCAGCGCGGGCAGCCGGCGCCCCCGTGCAAACTGGGG 

GTGTCTGCCAGAGCAGCCCCAGCCGCTGCCGCTGCTACCCCCGATGCTGGCCATGGCCTGGCGG 

GGCGCAGGGCCGAGCGTCCCGGGGGCGCCCGGGGGCGTCGGTCTCAGTCTGGGGTTGCTCCTG 

CAGTTGCTGCTGCTCCTGGGGCCGGCGCGGGGCTTCGGGGACGAGGAAGAGCGGCGCTGCGAC 

CCCATCCGCATCTCCATGTGCCAGAACCTCGGCTACAACGTGACCAAGATGCCCAACCTGGTTG 

GGCACGAGCTGCAGACGGACGCCGAGCTGCAGCTGACAACTTTCACACCGCTCATCCAGTACG 

GCTGCTCCAGCCAGCTGCAGTTCITCCTITGTTCTGtrTATGTGCCAATGTGCACAGAGAAGATC 

AACATCCCCATTGGCCCATGCGGCGGCATGTGTCTTTCAGTCAAGAGACGCTGTGAACCCGTCC 

TGAAGGAATTTGGATTTGCCTGGCCAGAGAGTCTGAACTGCAGCAAATTCCCACCACAGAACG 

ACCACAACCACATGTGCATGGAAGGGCCAGGTGATGAAGAGGTGCCCTTACCTCACAAAACCC 

CCATCCAGCCTGGGGAAGAGTGTCACTCTGTGGGAACCAATTCTGATCAGTACATCTGGGTGAA 

AAGGAGCCTGAACTGTGTGCTCAAGTGTGGCTATGATGCTGGCTTATACAGCCGCTCAGCCAAG 

GAGTTCACTGATATCTGGATGGCTGTGTGGGCCAGCCTGTGTTTCATCTCCACTGCCTTCACAGT 

ACTGACCTTCCTGATCGATTCnTCTAGGTTTTCCTACCCTGAGCGCCCCATCATATTTCTCAGTA 

TGTGCTATAATATrTATAGCATTGCTTATATTGTCAGGCTGACTGTAGGCCGGGAAAGGATATC 

CTGTGATTITGAAGAGGCAGCAGAACCTGTTCTCATCCAAGAAGGACTTAAGAACACAGGATG 

TGCAATAATTTTCTTGCTGATGTACTTTTTTGGAATGGCCAGCTCCATTTGGT 

CACTCACTTGGTTrTTGGCAGCAGGACTCAAATGGGGTCATGAAGCCATTGAAATGCACAGCTC 

TTATTTCCACATTGCAGCCTGGGCCATCCCCGCAGTGAAAACCATTGTCATCTTGATTATGAGA 

CTGGTGGATGCAGATGAACTGACTGGCTrGTGCTATGTTGGAAACCAAAATCTCGATGCCCTCA 

CCGGGTTCGTGGTGGCTCCCCTCTTTACTTATTTGGTCATTGGAACTTTGTTCATTGCTGCAGGT 

TTGGTGGCCTTGTTCAAAATTCGGTCAAATCTTCAAAAGGATGGGACAAAGACAGACAAGTTA 

GAAAGACTGATGGTCAAGATTGGGGTGTTCTCAGTACTGTACACAGTTCCTGCAACGTGTGTGA 

TTGCCTGTTATTTTTATGAAATCTCCAACTGGGCACTTTTTCGGTATTCTGCAGATGATTCCA^ 

ATGGCTGTTGAAATGTTGAAAACTTTTATGTCTTTGTTGGTGGGCATCACTTCAGGCATGTGGAT 

TTGGTCTGCCAAAAGTCTTCACACGTGGCAGAAGTGTTCCAACAGATTGGTGAATTCTGGAAAG 

GTAAAGAGAGAGAAGAGAGGAAATGGTTGGGTGAAGCCTGGAAAAGGCAGTGAGACTGTGGT 

ATAAGGCTAGTCAGCCTCCATGCTTTCTTCATTTTGAAGGGGGGAATGCCAGCATTTTGGAGGA 

AATTCTACTAAAAGTTTTATGCAGTGAATCTCAGTTTGAACAAACTAGCAACAATTAAGTGACC 

CCCGTCAACCCACTGCCTCCCACCCCGACCCCAGCATCAAAAAACCAATGATTTTGCTGCAGAC 

TITGGAATGATCCAAAATGGAAAAGCCAGTTAGAGGCTTTCAAAGCTGTGAAAAATCAAAACG 

TTGATCACTITAGCAGGTTGCAGCTTGGAGCGTGGAGGTCCTGCCTAGATTCCAGGAAGTCCAG 

GGCGATACTGTTTTCCCCTGCAGGGTGGGATTTGAGCTGTGAGTTGGTAACTAGCAGGGAGAAA 

TATTAACTTTTTTAACCCTTTACCATTTTAAATACTAACTGGGTCTTTC 

TATAAACACTGGAAACGCTGGGTTCAGAAAAGTGTTACAAGAGTTTTATAGTTTGGCTGATGTA 

ACATAAACATCTTCTGTGGTGCGCTGTCTGCrGTTTAGAACTTTGTGGACTGCACTCCCAAGAA 

GTGGTGTTAGAATCTTTCAGTGCCTTTGTCATAAAACAGTTATTTGAACAAACAAAAGTACTGT 

ACTCACACACATAAGGTATCCAGTGGATTTTTCTTCTCT 
CTOTCTTGGCTGCTGCTGTTTTCTTCATTTrATGTTAATGACTCAAA 

TITITGTACTGCAGCATGCTTAAAGAGGGGAAAAGGAAGGGTGATTCACTTTCTGACAATCACT 

TAATTCAGAGGAAAATGAGATTTACTAAGTTGACTTACCTGACGGACCCCAGAGACCTATTGCA 

TTGAGCAGTGGGGACTTAATATATTTTACTTGTGTGATTGCATCTATGCAGACGCCAGTCTGGA 

AGAGCTGAAATGTTAAGTTrCTTGGCAACTTTGCATrCACACAGATTAGCTGTGTAATTm 

TGTCAATTACAATTAAAAGCACATTGTTGGACCATGACATAGTATACTCAACTGACTTTAAAAC 

TATGGTCAACTTCAACTTGCATTCTCAGAATGATAGTGCCTrTAAAA 11 1 1 1 1 1 ATTTTI 1 AAAG 

CATAAGAATGTTATCAGAATCTGGTCTACTTAGGACAATGGAGACTTTTTCAGTTTTATAAAGG 

GAACTGAGGACAGCTAATCCAACTACTTGGTGCTGTAATTGTTTCCTAGTAATTGGCAAAGGCT 

CCTTGTAAGATTrCACTGGAGGCAGTGTGGCCTGGAGTATTTATATGGTGCTTAATGAATCTCC 

AGAATGCCAGCCAGAAGCCTGATTGGTTAGTAGGGAATAAAGTGTAGACCATATGAAATGAAC 

TGCAAACTCTAATAGCCCAGGTCTTAATTGCCTTTAGCAGAGGTATCCAAAGCTTTTAAAAT^ 

ATGCATACGTTCTTCACAAGGGGGTACCCCCAGCAGCCTCTCGAAAATTGCACTTCTCTTAAAA 

CTGTAACTGGCCTTTCTCTrACCTrGCCTTAGGCCTTCTAATCATGAGATCTTGGGGACAAATTG 

ACTATGTCACAGGTTGCTCTCCTTGTAACTCATACCTGTCTGCTTCAGCAACTG CTTTGCAATGA 

CATTTATTTATTAATTCATGCCnTAAAAAAATAGGAAGGGAAGClUllll'ITTTCTl"l'l'l nTTTI 
TrCAATCACACTTTGTGGAAAAACATTTCCAGGGACTCAAAATTCCAAAAAGGTGGTCAAATTC 

TGGAAGTAAGCATTTCCrCTTTTTTAAAAATTTGGTTTGAG 



WO 03/012082 PCT/GB02/03409 



28/41 



CCCmCTrCTTTCCTTITrGTTTTTGTGTGGTTCTTGAGCT 
GTCGATTGTATGTTTTGAAGGCAAAGTCTTC 

GCCCCTGCTGCTGTGCCCAGTCTGAGTACOTGGCTAGACTCTAGGTCAGGCTCCAGGAGCATG 

AGAATTGATCCCCAGAAGAACCATTTTAACTCCATCTGATACTCCATTGCCTATGAAATGTAAA 

ATGTGAACTCCCTGTGCTGCTTGTAGACAGTTCCCATAACTGTCCACGGCCCTGGAGCACGCAC 

CCAGGGGCAGAGCCTGCCCTTACTCACGCTCTGCTCTGGTGTCTTGGGAGTTGTGCAGGGACTC 

TGGCCCAGGCAGGGGAAGGAAGACCAGGCGGTAGGGGACTGGTCTTGCTGTTAGAGTATAGAG 

GTTTGTAATGCAGTTTTCTTCATAATGTGTCAGTGATTGTGTGACCAAGGCAGCATCTAGCAGA 

AAGCCAGGCATGGAGTAGGTGATCGATACTTGTCAATGACTAAATAATAACAATAAAAGAGCA 

CTTGGGTGAATCTGGGCACCTGATTTCTGAGTTTTGAGTTCTGGAGCTAGTGTTTTGACAATGCT 

TTGGGTTTTGACATGCCTTlTCCACAAATCrrCTrGCCTTTTCAGGGCAAA.GTGTATTTG 

AGTGGCCATTTGGATTAGTAGCCTTAGCAATGCTACAGGGTTATAGGCCCCTCTCCCTTTCACAT 

TCCAGACAATGGAGAGTGTITATGGTTTCAGGAAAAGAACTTTGTGGCTGAGGGGTCAGTTACC 

AGTGACClTCAATCAACTCCATCACTTCTTAAATCGGTATITGTrAAAAAAATCAGTTATTTrAT 

TTATTGAGTGCCGACTGTAGTAAAGCCCTGAAATAGATAATCTCTGTTCTTCTAACTGATCTAG 

GATGGGGACGCACCCAGGTCTGCTGAACTTTACTGTTCCTCTGGGAAAGGAGCAGGGACCTCTG 

GAATTCCCATCTGTTTCACTGTCTCCATTCCATAAATCTCTTCCTGTGTGAGCCACCACACCCAG 

CCTGGGTCTCTCTACTTTTAACACATCTCTCATCCCTTTCCCAGGACTTCCTTCCAAGTCAGTTAC 

AGGTGGTTTTAACAGAAAGCATCAGCTCTGCrTCGTGACAGTCTCTGGAGAAATCCCTTAGGAA 

GACTATGAGAGTAGGCCACAAGGACATGGGCCCACACATCTGCTTTGGCTTTGCCGGCAATTCA 

GGGCTTGGGGTATTCCATGTGACTTGTATAGGTATATTTGAGGACAGCATCTTGCTAGAGAAAA 

GGTGAGGGTTGTTTTTCTITCrCTGAAACCTACAGTAAATGGGTATGATrGTAGCTrCCTCAGAA 

ATCCCITGGCCTCCAGAGATTAAACATGGTGCAATGGCACCTCTGTCCAACCTCC'TTTCTGGTA 

GATTCCTTTCTCCTGCTTCATATAGGCCAAACCTCAGGGCAAGGGAACATGGGGGTAGAGTGGT 

GCTGGCCAGAACCATCTGCTTGAGCTACTTGGTTGATTCATATCCTCITTCCTTTATGGAGACCC 

ATTTCCTGATCTCTGAGACTGTTGCTGAACTGGCAACTTACTTGGGCCTGAAACTGGAGAAGGG 

GTGACATTTTTTTAATTTCAGAGATGCTTTCTGA 

CTCTCCAAACTCAGGTTCCGGGAAGCTTGTGTGTCTAGATACTGAATTGAGATTCTGTTCAGCA 

CCTTTTAGCTCTATACTCTCTGGCTCCCCTCATCCTCATGGTCACTGAATTAAATGCTTATTGTAT 

TGAGAACCAAGATGGGACCTGAGGACACAAAGATGAGCTCAACAGTCTCAGCCCTAGAGGAAT 

AGACTCAGGGATTTCACCAGGTCGGTGCAGTATTTGATTTCTGGTGAGGTGACCACAGCTGCAG 

TTAGGAAGGGAGCCATTGAGCACAGACTTTGGAAGGAACCTTTTTTTTGTTGTT^ 

TTGTTTGTTTGTTTGTTTGAGACAGGGTCTTGCTCTGCTACCCAGGCTGGGGCGCAATGGCACGA 

TCTTGGCTCACTGCAACCTCTGCCTCCTGGGTTCAAGTGATTCTCCTGCCACAGCCTCCTGAGGA 

GCTGGGACTACAGGTGCGTGCTACCACGCCCAGCTACTTCTGTATTTTTAGTAGAGACGGGGTT 

TCACTGTGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCATGATCTGCCCGCCTCAGCCTCCCAA 

AGTGCTGGGATTACAAGTGTGAGCCACCACACCTGGCCTGGAAGGAACCTCTTAAAATCAGTTT 

ACGTCTTGTATTTTGTTCTGTGATGGAGGACACTGGAGAGAGTTGCTATTCCAGTCAATCATGTC 

GAGTCACTGGACTCrGAAAATCCTATTGGTTCCTTTATrrTATTTGAGTTTAGAGTTCCCTTCTG 

GGTTTGTATTATGTCTGGCAAATGACCTGGGTTATCACTITTCCTCCAGGGTTAGATCATAGATC 

TTGGAAACTCCTTAGAGAGCATTTTGCTCCTACCAAGGATCAGATACTGGAGCCCCACATAATA 

GATITCATITCACTCTAGCCTACATAGAGCTTTCTGTTGCTGTCTCTTGCCATGCACTrGTGCGG 

TGATTACACACTTGACAGTACCAGGAGACAAATGACTTACAGATCCCCCGACATGCCTCTTCCC 

CTTGGCAAGCTCAGTTGCCCTGATAGTAGCATGTTTCTGTTTCrGATGTACCl'l i 1 1 rCTCTTCTT 

CTITGCATCAGCCAATTCCCAGAATTTCCCCAGGCAATTTGTAGAGGACCTTTTTGGGGTCCTAT 

ATGAGCCATGTCCTCAAAGCTTTTAAACCTCCTTGCTCTCCTACAATATTCAGTACATGACCACT 

GTCATCCTAGAAGGCTTCTGAAAAGAGGGGCAAGAGCCACTCTGCGCCACAAAGGTTGGATCC 

ATCTTCTCTCCGAGGTTGTGAAAGTTTTCAAATTGTACTAATAGGCTGGGGCCCTGACTTGGCTG 

TGGGCTTTGGGAGGGGTAAGCTGCTTTCTAGATCTCTCCCAGTGAGGCATGGAGGTGTTTCTGA 

ATrTTGTCTACCTCACAGGGATGTTGTGAGGCTTGAAAAGGTCAAAAAATGATGGCCCCTTGAG 

CTCTTrGTAAGAAAGGTAGATGAAATATCGGATGTAATCTGAAAAAAAGATAAAATGTGACTT 

CCCCTGCTCTGTGCAGCAGTCGGGCTGGATGCTCrGTGGCNTTTCTTGGGTCCTCATGCCACCCC 

ACAGCTCCAGGAACCTTGAAGCCAATCTGGGGACITrCAGATGTTTGACAAAGAGGTACCAGG 
CAAACTTCCTGCTACACATGCCCTGAATGAATTGCTAAATTTCAAAGGAAATGGACCCTGCTTT 
TAAGGATGTACAAAAGTATGTCrGCATCGATGTCTGTACTGTAAATTTCTAATTTATCACTGTAC 
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AAAGAAAACCCCTTGCTATTTAATTTTGTATTAAAGGAAAATAAAGTm 
AA 

Figure 34 

ACCCAGGGACGGAGGACCCAGGCTGGCTTGGGGACTGTCTGCTCTTCTCGGCGGGAGCCGTGG 

AGAGTCCTTTCCCTGGAATCCGAGCCCTAACCGTCTCTCCCCAGCCCTATCCGGCGAGGAGCGG 

AGCGCTGCCAGCGGAGGCAGCGCCTTCCCGAAGCAGTTTAT(nTrGGACGGTTTTCTTTAAAGG 

AAAAACGAACCAACAGGTTGCCAGCCCCGGCGCCACACACGAGACGCCGGAGGGAGAAGCCC 

CGGCCCGGATTCCTCTGCCTGTGTGCGTCCCTCGCGGGCTGCTGGAGGCGAGGGGAGGGAGGG 

GGCGATGGCTCGGCCTGACCCATCCGCGCCGCCCTCGCTGTTGCTGCTGCTCCTGGCGCAGCTG 

GTGGGCCGGGCGGCCGCCGCGTCCAAGGCCCCGGTGTGCCAGGAAATCACGGTGCCCATGTGC 

CGCGGCATCGGCTACAACCTGACGCACATGCCCAACCAGTTCAACCACGACACGCAGGACGAG 

GCGGGCCTGGAGGTGCACCAGTTCTGGCCGCTGGTGGAGATCCAATGCTCGCCGGACCTGCGCT 

TCTTCCTATGCACTATGTACACGCCCATCTGTCTGCCCGACTACCACAAGCCGCTGCCGCCCTGC 

CGCTCGGTGTGCGAGCGCGCCAAGGCCGGCTGCTCGCCGCTGATGCGCCAGTACGGCTTCGCCT 

GGCCCGAGCGCATGAGCTGCGACCGCCTCCCGGTGCTGGGCCGCGACGCCGAGGTCCTCTGCA 

TGGATTACAACCGCAGCGAGGCCACCACGGCGCCCCCCAGGCCTTTCCCAGCCAAGCCCACCCT 

TCCAGGCCCGCCAGGGGCGCCGGCCTCGGGGGGCGAATGCCCCGCTGGGGGCCCGTTCGTGTG 

CAAGTGTCGCGAGCCCTTCGTGCCCATTCTGAAGGAGTCACACCCGCTCTACAACAAGGTGCGG 

ACGGGCCAGGTGCCCAACTGCGCGGTACCCTGCTACCAGCCGTCCTTCAGTGCCGACGAGCGC 

ACGTTCGCCACCTTCTGGATAGGCCTGTGGTCGGTGCTGTGCTTCATCTCCACGTCCACCACAGT 

GGCCACCTTCCTCATCGACATGGACACGTTCCGCTATCCTGAGCGCCCCATCATCTTCCTGTCAG 

CCTGCTACCTGTGCGTGTCGCTGGGCTTCCTGGTGCGTCTGGTCGTGGGCCATGCCAGCGTGGC 

CTGCAGCCGCGAGCACAACCACATCCACTACGAGACCACGGGCCCTGCACTGTGCACCATCGT 

CTTCCTCCTGGTCTACTTCTTCGGCATGGCCAGCTCCATCTGGTGGGTCATCCTGTCGCTCACCT 

GGTTCCTGGCCGCCGCGATGAAGTGGGGCAACGAGGCCATCGCGGGCTACGGCCAGTACTTCC 

ACCTGGCTGCGTGGCTCATCCCCAGCGTCAAGTCCATCACGGCACTGGCGCTGAGCTCCGTGGA 

CGGGGACCCAGTGGCCGGCATCTGCTACGTGGGCAACCAGAACCTGAACTCGCTGCGGCGCTT 

CGTGCTGGGCCCGCTGGTGCTCTACCTGCTGGTGGGCACGCTCTTCCTGCTGGCGGGCTTCGTGT 

CGCTCTTCCGCATCCGCAGCGTCATCAAGCAGGGCGGCACCAAGACGGACAAGCTGGAGAAGC 

TCATGATCCGCATCGGCATCTTCACGCTGCTCTACACGGTCCCCGCCAGCATTGTGGTGGCCTG 

CTACCTGTACGAGCAGCACTACCGCGAGAGCTGGGAGGCGGCGCTCACCTGCGCCTGCCCGGG 

CCACGACACCGGCCAGCCGCGCGCCAAGCCCGAGTACTGGGTGCTCATGCTCAAGTACTTCATG 

TGCCTGGTGGTGGGCATCACGTCGGGCGTCTGGATCTGGTCGGGCAAGACGGTGGAGTCGTGG 

CGGCGTTTCACCAGCCGCTGCTGCTGCCGCCCGCGGCGCGGCCACAAGAGCGGGGGCGCCATG 

GCCGCAGGGGACTACCCCGAGGCGAGCGCCGCGCTCACAGGCAGGACCGGGCCGCCGGGCCCC 

GCCGCCACCTACCACAAGCAGGTGTCCCTGTCGCACGTGTAGGAGGCTGCCGCCGAGGGACTC 

GGCCGGAGAGCTGAGGGGAGGGGGGCGTTTTGTTTGGTAGTTTTGCCAAGGTCAOTCCGTTTA 

CCTTCATGGTGCTGTTGCCCCCTCCCGCGGCGACTTGGAGAGAGGGAAGAGGGGCGTTTTCGAG 

GAAGAACCTGTCCCAGGTCTTCrCCAAGGGGCCCAG 

CCTGCCTTCTTrATGGGAACCCrCTTTTTAATTTATATGTAT 



Figure 35 

GCAGCTCCAGTCCCGGACGCAACCCCGGAGCCGTCTCAGGTCCCTGGGGGGAACGGTGGGTTA 

GACGGGGACGGGAAGGGACAGCGGCCTTCGACCGCCCCCCGAGTAATTGACCCAGGACTCATT 

TTCAGGAAAGCCTGAAAATGAGTAAAATAGTGAAATGAGGAATTTGAACATTTTATCTTTGGAT 

GGGGATCTTCTGAGGATGCAAAGAGTGATrCATCCAAGCCATGTGGTAAAATCAGGAATTTGA 

AGAAAATGGAGATGTTTACATTITrGTrGACGTGTATTTTTCTACCCCTCCTAAGA^ 

CTCITCACCTGTGAACCAATTACTGTTCCCAGATGTATGAAAATGGCCTACAACATGACGi l 1 1 1 

CCCTAATCTGATGGGTCATTATGACCAGAGTATTGCCGCGGTGGAAATGGAGCATTITCTTCCT 

CTCGCAAATCTGGAATGTTCACCAAACATTGAAACTTTCCTCTGCAAAGCATTTGTACCAACCT 

GCATAGAACAAATTCATGTGGTTCCACCTTGTCGTAAACTITGTGAGAAAGTATATTCTGATTG 
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CAAAAAATTAATTGACACTTITGGGATCCGATGGCCTGAGGAGCTTGAATGTGACAGATTACAA 
TACTGTGATGAGACTGTTCCTGTAACTTTTGATCCACACACAGAATTTCTTGGTCCTCAGAAGA 
AAACAGAACAAGTCCAAAGAGACATTGGATTTTGGTGTCCAAGGCATCTTAAGACTTCTGGGG 
GACAAGGATATAAGTTTCTGGGAATTGACCAGTGTGCGCCTCCATGCCCCAACATGTATTTTAA 

AAGTGATGAGCTAGAGTTTGCAAAAAGTTTTATTGGAACAGTTTCAATATTTT 
ACTCTGTTCACATTCCTTAtmTTTTAATTGATGmGAAGATTCAGATACCCAGAGAGACCAAT 
TATATATTACTCTGTCTGTTACAGCATTGTATCTCTTATGTACTTCATTGGATTmGCTGGGCGA 
TAGCACAGCCTGCAATAAGGCAGATGAGAAGCTAGAACTTGGTGACACTGTTGTCCTAGGCTCT 

CAAAATAAGGCTTGCACCGTTTTGTTCATGCnTITGTATTTTTTCACAATC 

GTGGGTGATTCTTACCATTACITGGTTCTTAGCTGCAGGAAGAAAATGGAGTTGTGAAGCCATC 

GAGCAAAAAGCAGTGTGGTTTCATGCTGTTGCATGGGGAACACCAGGTTTCCTGACTGTTATGC 

TTCTrGCTCTGAACAAAGTTGAAGGAGACAACATTAGTGGAGTTTGCTTrGTTGGCCTITATGA 

CCTGGATGCTrCTCGCTACTTTGTACTCTTGCCACTGTGCCTTTGTGTGTTTGTTGGGCTCTCTCT 

TCTTITAGCTGGCATrATTTCCTTAAATCATGTrCGACAAGTCATACAACATGATGGCCGGAACC 

AAGAAAAACTAAAGAAATTTATGATTCGAATTGGAGTCTTCAGCGGCTTGTATCTTGTGCCATT 

AGTGACACITCTCGGATGTTACGTCTATGAGCAAGTGAACAGGATTACCTGGGAGATAACTTGG 

GTCTCTGATCATTGTCGTCAGTACCATATCCCATGTCCTTATCAGGCAAAAGCAAAAGCTCGAC 

CAGAArTGGCTTTATTrATGATAAAATACCTGATGACATTAATTGTTGGCATCTCTGCTGTCTTC 

TGGGTTGGAAGCAAAAAGACATGCACAGAATGGGCTGGGTITnTAAACGAAATCGCAAGAGA 

GATCCAATCAGTGAAAGTCGAAGAGTACTACAGGAATCATGTGAGTTTTTCTTAAAGCACAATT 

CTAAAGTTAAACACAAAAAGAAGCACTATAAACCAAGTTCACACAAGCTGAAGGTCATTTCCA 

AATCCATGGGAACCAGCACAGGAGCTACAGCAAATCATGGCACTTCTGCAGTAGCAATTACTA 

GCCATGATTACCTAGGACAAGAAACTTTGACAGAAATCCAAACCTCACCAGAAACATCAATGA 

GAGAGGTGAAAGCGGACGGAGCTAGCACCCCCAGGTTAAGAGAACAGGACTGTGGTGAACCT 

GCCTCGCCAGCAGCATCCATCTCCAGACTCTCTGGGGAACAGGTCGACGGGAAGGGCCAGGCA 

GGCAGTGTATCTGAAAGTGCGCGGAGTGAAGGAAGGATTAGTCCAAAGAGTGATATTACTGAC 

ACTGGCCTGGCACAGAGCAACAATTTGCAGGTCCCCAGTTCTTCAGAACCAAGCAGCCTCAAA 

GGTrCCACATCTCTGCTTGTTCACCCAGTTTCAGGAGTGAGAAAAGAGCAGGGAGGTGGTTGTC 

ATTCAGATACTTGAAGAACATTTTCTCTCGTTACTCAGAAGCAAATTTGTGTTACACTGGAAGT 

GACCTATGCACTGTTTTGTAAGAATCACTGTTACGTTCTTCTTTTGCACTTAAAGTTGCATTGCC 

TACTGTTATACTGGAAAAAATAGAGTTCAAGAATAATATGACTCATTTCACACAAAGGTTAATG 

ACAACAATATAGCTGAAAACAGAAATGTGCAGGTTAATAATATTTTTTTAATAGTGTGGGAGGA 

CAGAGTTAGAGGAATCTTCCITITCTATTTATGAAGATTCT^ 

TGTACTATGCTATTTTACCTTTTTGATATAAAATCAAGATATITCTTTGCrGA 

TATCCnTGTATCTTTTTATACATATTTGAAAATAAGCTTATATGTATTTGAACTTT^ 

TATTCAAGTATTTTTATCATGCTATTGTGATATC 

TAAGAAAATTGTAAAATAGTCTTCTTTrATACTGTAAAAAAAGATATACCAAAAAGTCITATAA 
TAGGAATTTAACTTTAAAAACCCACTTATTGATACCTTACCATCTAAAATGTGTGATTm 
TCTCGTTTTAGGAATTTCACAGATCTAAATTATGTAACTGAAATAAGGTGCTTACTCAAAGAGT 
GTCCACTATTGATTGTATrATGCrGCrCACTGATCCTrCTGCATATTTAAAATAAAATGTCCTAA 
AGGGTTAGTAGACAAAATGTTAGTCTTTTGTATATTAGGCCAAGTGCAATTG ACTTCC Cl 1 1 1 1 1 
AATGTTTCATGACCACCCATTGATTGTATTATAACCACTTACAGTTGCTTATAll llllGTTTTAA 

CTTITGTTTCTTAACATTTAGAATA^ 
AG 



Figure 36 

CTCTCCCAACCGCCTCGTCGCACTCCTCAGGCTGAGAGCACCGCTGCACTCGCGGCCGGCGATG 
CGGGACCCCGGCGCGGCCGCTCCGCTTTCGTCCCTGGGCCTCTGTGCCCTGGTGCTGGCGCTGC 
TGGGCGCACTGTCCGCGGGCGCCGGGGCGCAGCCGTACCACGGAGAGAAGGGCATCTCCGTGC 
CGGACCACGGCTTCTGCCAGCCCATCTCCATCCCGCTGTGCACGGACATCGCCTACAACCAGAC 
CATCCTGCCCAACCTGCTGGGCCACACGAACCAAGAGGACGCGGGCCTCGAGGTGCACCAGTT 
CTACCCGCTGGTGAAGGTGCAGTGTTCTCCCGAACTCCGCTTTTTCTTATGCTCCATGTATGCGC 
CCGTGTGCACCGTGCTCGATCAGGCCATCCCGCCGTGTCGTTCTCTGTGCGAGCGCGCCCGCCA 
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GGGCTGCGAGGCGCTCATGAACAAGTTCGGCTTCCAGTGGCCCGAGCGGCTGCGCTGCGAGAA 

CTTCCCGGTGCACGGTGCGGGCGAGATCTGCGTGGGCCAGAACACGTCGGACGGCTCCGGGGG 

CCCAGGCGGCGGCCCCACTGCCTACCCTACCGCGCCCTACCTGCCGGACCTGCCCTTCACCGCG 

CTGCCCCCGGGGGCCTCAGATGGCAGGGGGCGTCCCGCCTTCCCCTTCTCATGCCCCCGTCAGC 

TCAAGGTGCCCCCGTACCTGGGCTACCGCTTCCTGGGTGAGCGCGATTGTGGCGCCCCGTGCGA 

ACCGGGCCGTGCCAACGGCCTGATGTACTTTAAGGAGGAGGAGAGGCGCTTCGCCCGCCTCTG 

GGTGGGCGTGTGGTCCGTGCTGTGCTGCGCCrCGACGCTCTTTACCGTTCTCACCTACCTGGTGG 

ACATGCGGCGCTTCAGCTACCCAGAGCGGCCCATCATCTTCCTGTCGGGCTGCTACTTCATGGT 

GGCCGTGGCGCACGTGGCCGGCITCCTTCTAGAGGACCGCGCCGTGTGCGTGGAGCGCTTCTCG 

GACGATGGCTACCGCACGGTGGCGCAGGGCACCAAGAAGGAGGGCTGCACCATCCTCTTCATG 

GTGCTCTACTrCTTCGGCATGGCCAGCTCCATCTGGTGGGTCATTCTGTCrCTCACTTGGTTCCT 

GGCGGCCGGCATGAAGTGGGGCCACGAGGCCATCGAGGCCAACTCGCAGTACTTCCACCTGGC 

CGCGTGGGCCGTGCCCGCCGTCAAGACCATCACTATCCTGGCCATGGGCCAGGTAGACGGGGA 

CCTGCTGAGCGGGGTGTGCTACGTTGGCCTCTCCAGTGTGGACGCGCTGCGGGGCTTCGTGCTG 

GCGCCTCTGTrCGTCTACCTCTTCATAGGCACGTCCTTCTTGCTGGCCGGCTrCGTGTCCCTCTTC 

eGTATCCGCACCATCATGAAACACGACGGCACCAAGACCGAGAAGCTGGAGAAGCTCATGGTG 

CGCATCGGCGTCTTCAGCGTGCTCTACACAGTGCCCGCCACCATCGTCCTGGCCTGCTACTTCTA 

CGAGCAGGCCTTCCGCGAGCACTGGGAGCGCACCTGGCTCCTGCAGACGTGCAAGAGCTATGC 

CGTGCCCTGCCCGCCCGGCCACTTCCCGCCCATGAGCCCCGACTTCACCGTCTTCATGATCAAG 

TACCTGATGACCATGATCGTCGGCATCACCACrGGCTTCTGGATCTGGTCGGGCAAGACCCTGC 

AGTCGTGGCGCCGCTTCTACCACAGACTTAGCCACAGCAGCAAGGGGGAGACTGCGGTATGAG 

CCCCGGCCCCTCCCCACCTTTCCCACCCCAGCCCTCTTGCAAGAGGAGAGGCACGGTAGGGAAA 

AGAACTGCTGGGTGGGGGCCTGTTTCTGTAACTTTCTCCCCCTCTACTGAGAAGTGACCTGGAA 

GTGAGAAGTTCTTTGCAGATTTGGGGCGAGGGGTGATTTGGAAAAGAAGACCTGGGTGGAAAG 

CGGTTTGGATGAAAAGATTTCAGGCAAAGACTTGCAGGAAGATGATGATAACGGCGATGTGAA 

TCGTCAAAGGTACGGGCCAGCTTGTGCCTAATAGAAGGTTGAGACCAGCAGAGACTGCTGTGA 

GTTTCTCCCGGCTCCGAGGCTGAACGGGGACTGTGAGCGATCCCCCTGCTGCAGGGCGAGTGGC 

CTGTCCAGACCCCTGTGAGGCCCCGGGAAAGGTACAGCCCTGTCTGCGGTGGCTGCTTTGTTGG 

AAAGAGGGAGGGCCTCCTGCGGTGTGCTTGTCAAGCAGTGGTCAAACCATAATCTCTTTTCACT 

GGGGCCAAACTGGAGCCCAGATGGGTTAATTTCCAGGGTCAGACATTACGGTCTCTCCTCCCCT 

GCCCCCTCCCGCCTGTTITrCCrCCCGTACTGCTTTCAGGTCTTGTAAAATAAGCATTTGG AAGT 

CTTGGGAGGCCTGCCTGCTAGAATCCTAATGTGAGGATGCAAAAGAAATGATGATAACATTTTG 

AGATAAGGCCAAGGAGACGTGGAGTAGGTATTTTTGCTACTTTTTCATTITCTGGGGAAGGCAG 

GAGGCAGAAAGACGGGTGTTTTATTrGGTCTAATACCCTGAAAAGAAGTGATGACTTGTTGCTT 

TTCAAAACAGGAATGCATTTTTCCCCITGTCTITGTTGTAAGAGACAAAAGAGGA^ 

GTCTCCCTGTGGAAAGGCATAACTGTGACGAAAGCAACTTTTATAGGCAAAGCAGCGCAAATC 

TGAGGTTTCCCGTTGGTTGTTAATTTGGTTGAGATA 

GCAGTGTGCTGTCACACACCGTTAAGCCAGAGGTTCTGACTTCGCTAAAGGAAATGTAAGAGG 

TTTTGTTGTCTGTTTTAAATAAATTTAATTCGGAACACATGATCCAACAGACT 

TCAGGGAAATCTCTCCOTCATTTACITITrCTTGCrATAAGCC^^ 

TTTTITCTCCCATTTGGATCCTTTGAGGTAAAAAAACATAATGTCIT 

AAGTTAATTAAAAAAAAAAAGCAAAGAGCCATTTTGTCCTGTTTTCTTGGTTCCATC 

TTATTAAACATCATCCATATGCTGACCCTGTCTCTGTGTGGTTGGGTTGGGAGGCGATCAGCAG 

ATACCATAGTGAACGAAGAGGAAGGTTTGAACCATGGGCCCCATCTTTAAAGAAAGTCATTAA 

AAGAAGGTAAACHrrCAAAGTGATTCTGGAGTTCTTTGAAATC 

AATCTTAAATCATGTACTTTTTTTCTGTAAT^ 

TTAGCAGAGAATCATGGGAGCTAACCTITATCCCACCTTTGACACTACCCTCCAATCTTGCAAC 

ACTATCCTGTTTCTCAGAACAGTTTTTAAATGCCAATCATAGAGGGTACTGTAAAGTGTACAAG 

TTACnTATATATGTAATGTTCACITGAGTGGAACTGCTTTTTACATTAAAG^ 

TGTGTTTCTTCAACCTTCAAAACTATCrCATCTGTCAGATTTTTAAAACTCCAACACAG 

GCATCTTTTGTGCTGTATCTTTTAAGTGCATGTGAAATTTGTAAAATAGAGAT AAGTA CAGTAT 

GTATATTTTGTAAATCTCCCATTTTTGTAAGAAAATATATATTGTATTTATACAIl'l'l'IA 

ATTTTTGTTTTGTTGGCTTTAAAGGTCTACCCCACTTTATCACATGTACAG^ 

TTTTTAAATAC 
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ACAGCATGGAGTGGGGTTACCTGTrGGAAGTGACCTCGCTGCTGGCCGCCTTGGCGCTGCTGCA 

GCGCTCTAGCGGCGCTGCGGCCGCCTCGGCCAAGGAGCTGGCATGCCAAGAGATCACCGTGCC 

GCTGTGTAAGGGCATCGGCTACAACTACACCTACATGCCCAATCAGTTCAACCACGACACGCA 

AGACGAGGCGGGCCTGGAGGTGCACCAGTTCTGGCCGCTGGTGGAGATCCAGTGCTCGCCCGA 

TCTCAAGTTCTTCCTGTGCAGCATGTACACGCCCATCTGCCTAGAGGACTACAAGAAGCCGCTG 

CCGCCCTGCCGCTCGGTGTGCGAGCGCGCCAAGGCCGGCTGCGCGCCGCTCATGCGCCAGTAC 

GGCTTCGCCTGGCCCGACCGCATGCGCTGCGACCGGCTGCCCGAGCAAGGCAACCCTGACACG 

CTGTGCATGGACTACAACCGCACCGACCTAACCACCGCCGCGCCCAGCCCGCCGCGCCGCCTGC 

CGCCGCCGCCGCCCGGCGAGCAGCCGCCTTCGGGCAGCGGCCACGGCCGCCCGCCGGGGGCCA 

GGCCCCCGCACCGCGGAGGCGGCAGGGGCGGTGGCGGCGGGGACGCGGCGGCGCCCCCAGCT 

CGCGGCGGCGGCGGTGGCGGGAAGGCGCGGCCCCCTGGCGGCGGCGCGGCTCCCTGCGAGCCC 

GGGTGCCAGTGCCGCGCGCCTATGGTGAGCGTGTCCAGCGAGCGCCACCCGCTCTACAACCGC 

GTCAAGACAGGCCAGATCGCTAACTGCGCGCTGCCCTGCCACAACCCCTTTTTCAGCCAGGACG 

AGCGCGCCTTCACCGTCTTCTGGATCGGCCTGTGGTCGGTGCTCTGCTTCGTGTCCACCTTCGCC 

ACCGTCTCCACCTTCCTTATCGACATGGAGCGCTTCAAGTACCCGGAGCGGCCCATTATCTTCCT 

CTCGGCCTGCTACCTCTTCGTGTCGGTGGGCTACCTAGTGCGCCTGGTGGCGGGCCACGAGAAG 

GTGGCGTGCAGCGGTGGCGCGCCGGGCGCGGGGGGCGCTGGGGGCGCGGGCGGCGCGGCGGC 

GGGCGCGGGCGCGGCGGGCGCGGGCGCGGGCGGCCCGGGCGGGCGCGGCGAGTACGAGGAGC 

TGGGCGCGGTGGAGCAGCACGTGCGCTACGAGACCACCGGCCCCGCGCTGTGCACCGTGGTCT 

TCITGCTGGTCTACTTCTTCGGCATGGCCAGCTCCATCTGGTGGGTGATCTTGTCGCTCACATGG 

TTCCTGGCGGCCGGTATGAAGTGGGGCAACGAAGCCATCGCCGGCTACTCGCAGTACTTCCACC 

TGGCCGCGTGGCTTGTGCCCAGCGTCAAGTCCATCGCGGTGCTGGCGCTCAGCTCGGTGGACGG 

CGACCCGGTGGCGGGCATCTGCTACGTGGGCAACCAGAGCCTGGACAACCTGCGCGGCTTCGT 

GCTGGCGCCGCTGGTCATCTACCTCTTCATCGGCACCATGTTCCTGCTGGCCGGCTTCGTGTCCC 

TGTTCCGCATCCGCTCGGTCATCAAGCAACAGGACGGCCCCACCAAGACGCACAAGCTGGAGA 

AGCTGATGATCCGCCTGGGCCTGTTCACCGTGGTCTACACCGTGCCCGCCGCGGTGGTGGTCGC 

CTGCCTCTTCTACGAGCAGCACAACCGCCCGCGCTGGGAGGCCACGCACAACTGCCCGTGCCTG 

CGGGACCTGCAGCCCGACCAGGCACGCAGGCCCGACTACGCCGTCTTCATGCTCAAGTACTTCA 

TGTGCCTAGTGGTGGGCATCACCTCGGGCGTGTGGGTCTGGTCCGGCAAGACGCTGGAGTCCTG 

GCGCTCCCTGTGCACCCGCTGCTGCTGGGCCAGCAAGGGCGCCGCGGTGGGCGGGGGCGCGGG 

CGCCACGGCCGCGGGGGGTGGCGGCGGGCCGGGGGGCGGCGGCGGCGGGGGACCCGGCGGCG 

GCGGGGGGCCGGGCGGCGGCGGGGGCTCCCTCTACAGCGACGTCAGCACTGGCCTGACGTGGC 

GGTCGGGCACGGCGAGCTCCGTGTCTTATCCAAAGCAGATGCCATTGTCCCAGGTCTGAGCGGA 

GGGGAGGGGGCGCCCAGGAGGGGTGGGGAGGGGGGCGAGGAGACCCAAGTGCAGCGAAGGG 

ACACITGATGGGCTGAGGTTCCCACCCCTTCACAGTGTTGATTGCTATTAGCATGATAATGAAC 

TCTrAATGGTATCCATTAGCTGGGACTTAAATGACrCACTTAGAACAAAGTACCTGGCATTGAA 

GCCTCCCAGACCCAGCCCCTTTTCCrCCATTGATGTGCGGGGAGCTCCTCCCGCCACGCGTTAAT 

TTCTGTTGGCTGAGGAGGGTGGACTCTGCGGCGTTTCCAGAACCCGAGATTTGGAGCCCTCCCT 

GGCTGCACTTGGCTGGGTTTGCAGTCAGATACACAGATTTCACCTGGGAGAACCTCTrTTTCTCC 

CTCGACTCTTCCTACGTAAACTCCCACCCCTGACTTACCCTGGAGGAGGGGTGACCGCCACCTG 

ATGGGATTGCACGGTTTGGGTATrCTTAATGACCAGGCAAATGCCTTAAGTAAACAAACAAGA 

AATGTCTTAATTATACACCCCACGTAAATACGGGTTTOTACATTAGAGGATGTATTTATATAAT 

TATTTGTTAAATTGTAAAAAAAAAAAGTGTAAAATATGTATATATCCAAAGATATAGTGTGTAC 

ATTTTTTTGTAAAAAGTTTAGAGGCTrACCCCTGTAAGAACAGATATA^ 

ATAAAATGACTTTTGATAAATGATTrAACCATTGCCCTCTCCCCCGCCTCTTCTGAGCTGTCACC 
TTTAAAGTGCTTGCTAAGGACGCATGGGGAAAATGGACATTITCTGGCTTGTCATTCTGTACAC 
TGACCTTAGGCATGGAGAAAATTACTrGTTAAACTCTAGTTCTTAAGTTGTTAGCCAAGTAAAT 
ATCATTGTTGAACTGAAATCAAAATTGAGTTTTTG 

GAGCTCnTITCTGATCCATGGATAACAACrCrrCACTTTAGTGGATGTAAATGGAA 
GGCAGTAATTCCCCTTAGGCCTTGTTATITATCCrGCATGGTATCACTAAAGGTTTCAAAACCCT 

GAAAAAAAA 
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CCGCCTTCGGCCCGGGCCTCCCGGGATGGCCGTGGCGCCTCTGCGGGGGGCGCTGCTGCTGTGG 

CAGCTGCTGGCGGCGGGCGGCGCGGCACTGGAGATCGGCCGCTTCGACCCGGAGCGCGGGCGC 

GGGGCTGCGCCGTGCCAGGCGGTGGAGATCCCCATGTGCCGCGGCATCGGCTACAACCTGACC 

CGCATGCCCAACCTGCTGGGCCACACGTCGCAGGGCGAGGCGGCTGCCGAGCTAGCGGAGTTC 

GCGCCGCTGGTGCAGTACGGCTGCCACAGCCACCTGCGCTTCTTCCTGTGCTCGCTCTACGCGC 

CCATGTGCACCGACCAGGTCTCGACGCCCATTCCCGCCTGCCGGCCCATGTGCGAGCAGGCGCG 

CCTGCGCTGCGCGCCCATCATGGAGCAGTTCAACTTCGGCTGGCCGGACTCGCTCGACTGCGCC 

CGGCTGCCCACGCGCAACGACCCGCACGCGCTGTGCATGGAGGCGCCCGAGAACGCCACGGCC 

GGCCCCGCGGAGCCCCACAAGGGCCTGGGCATGCTGCCCGTGGCGCCGCGGCCCGCGCGCCCT 

CCCGGAGACCTGGGCCCGGGCGCGGGCGGCAGTGGCACCTGCGAGAACCCCGAGAAGTTCCAG 

TACGTGGAGAAGAGCCGCTCGTGCGCACCGCGCTGCGGGCCCGGCGTCGAGGTGTTCTGGTCC 

CGGCGCGACAAGGACTTCGCGCTGGTCTGGATGGCCGTGTGGTCGGCGCTGTGCTTCTTCTCCA 

CCGCCTTCACTGTGCTCACCTTCTTGCTGGAGCCCCACCGCTTCCAGTACCCCGAGCGCCCCATC 

ATCTTCCTCTCCATGTGCTACAACGTCTACTCGCTGGCCTTCCTGATCCGTGCGGTGGCCGGAGC 

GCAGAGCGTGGCCTGTGACCAGGAGGCGGGCGCGCTCTACGTGATCCAGGAGGGCCTGGAGAA 

CACGGGCTGCACGCTGGTCTTCCTACTGCTCTACTACTTCGGCATGGCCAGCTCGCTCTGGTGG 

GTGGTCCTGACGCTCACCTGGTTCCTGGCTGCCGGGAAGAAATGGGGCCACGAGGCCATCGAG 

GCCCACGGCAGCTATTTCCACATGGCTGCCTGGGGCCTGCCCGCGCTCAAGACCATCGTCATCC 

TGACCCTGCGCAAGGTGGCGGGTGATGAGCTGACTGGGCTTTGCTACGTGGCCAGCACGGATG 

CAGCAGCGCTCACGGGCTTCGTGCTGGTGCCCCTCTCTGGCTACCTGGTGCTGGGCAGTAGTTT 

CCTCCTGACCGGCTTCGTGGCCCTCTTCCACATCCGCAAGATCATGAAGACGGGCGGCACCAAC 

ACAGAGAAGCTGGAGAAGCTCATGGTCAAGATCGGGGTCTTCTCCATCCTCTACACGGTGCCCG 

CCACCTGCGTCATCGTTTGCTATGTCTACGAACGCCTCAACATGGACTTCTGGCGCCTTCGGGCC 

ACAGAGCAGCCATGCGCAGCGGCCGCGGGGCCCGGAGGCCGGAGGGACTGCTCGCTGCCAGG 

GGGCTCGGTGCCCACCGTGGCGGTCTTCATGCTCAAAATTTTCATGTCACTGGTGGTGGGGATC 

ACCAGCGGCGTCTGGGTGTGGAGCTCCAAGACTTTCCAGACCTGGCAGAGCCTGTGCTACCGCA 

AGATAGCAGCTGGCCGGGCCCGGGCCAAGGCCTGCCGCGCCCCCGGGAGCTACGGACGTGGCA 

CGCACTGCCACTATAAGGCTCCCACCGTGGTCTTGCACATGACTAAGACGGACCCCTCnTGGA 

GAACCCCACACACCTCTAGCCACACAGGCCTGGCGCGGGGTGGCTGCTGCCCCCTCCTTGCCCT 

CCACGCCCTGCCCCCTGCATCCCCTAGAGACAGCTGACTAGCAGCTGCCCAGCTGTCAAGGTCA 

GGCAAGTGAGCACCGGGGACTGAGGATCAGGGCGGGACCCCGTGAGGCTCATTAGGGGAGAT 

GGGGGTCTCCCCTAATGCGGGGGCTGGACCAGGCTGAGTCCCCACAGGGTCCTAGTGGAGGAT 

GTGGAGGGGCGGGGCAGAGGGGTCCAGCCGGAGTTTATTTAATGATGTAATTTATTGTTGCGTT 

CCTCTGGAAGCTGTGACTGGAATAAACCCCCGCGTGGCACTGCTGATCCTCTCTGGCTGGGAAG 

GGGGAAGGTAGGAGGTGAGGC 



Figure 39 

ACACGTCCAACGCCAGCATGCAGCGCCCGGGCCCCCGCCTGTGGCTGGTCCTGCAGGTGATGG 

GCTCGTGCGCCGCCATCAGCTCCATGGACATGGAGCGCCCGGGCGACGGCAAATGCCAGCCCA 

TCGAGATCCCGATGTGCAAGGACATCGGCTACAACATGACTCGTATGCCCAACCTGATGGGCC 

ACGAGAACCAGCGCGAGGCAGCCATCCAGTTGCACGAGTTCGCGCCGCTGGTGGAGTACGGCT 

GCCACGGCCACCTCCGCTTCTTCCTGTGCTCGCTGTACGCGCCGATGTGCACCGAGCAGGTCTC 

TACCCCCATCCCCGCCTGCCGGGTCATGTGCGAGCAGGCCCGGCTCAAGTGCTCCCCGATTATG 

GAGCAGTTCAACTTCAAGTGGCCCGACTCCCTGGACTGCCGGAAACTCCCCAACAAGAACGAC 

CCCAACTACCTGTGCATGGAGGCGCCCAACAACGGCTCGGACGAGCCCACCCGGGGCTCGGGC 

CTGTTCCCGCCGCTGTTCCGGCGGCAGCGGCCCCACAGCGCGCAGGAGCACCCGCTGAAGGAC 

GGGGGCCCCGGGCGCGGCGGCTGCGACAACCCGGGCAAGTTCCACCACGTGGAGAAGAGCGC 

GTCGTGCGCGCCGCTCTGCACGCCCGGCGTGGACGTGTACTGGAGCCGCGAGGACAAGCGCTT 

CGCAGTGGTCTGGCTGGCCATCTGGGCGGTGCTGTGCTTCTTCTCCAGCGCCTTCACCGTGCTCA 

CCTTCCTCATCGACCCGGCCCGCTTCCGCTACCCCGAGCGCCCCATCATCTTCCTCTCCATGTGC 

TACTGCGTCTACTCCGTGGGCTACCTCATCCGCCTCTTCGCCGGCGCCGAGAGCATCGCCTGCG 

ACCGGGACAGCGGCCAGCTCTATGTCATCCAGGAGGGACTGGAGAGCACCGGCTGCACGCTGG 

TCTTCCTGGTCCTCTACTACTTCGGCATGGCCAGCTCGCTGTGGTGGGTGGTCCTCACGCTCACC 
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TGGTTCCTGGCCGCCGGCAAGAAGTGGGGCCACGAGGCCATCGAAGCCAACAGCAGCTACTTC 

CACCTGGCAGCCTGGGCCATCCCGGCGGTGAAGACCATCCTGATCCTGGTCATGCGCAGGGTG 

GCGGGGGACGAGCTCACCGGGGTCTGCTACGTGGGCAGCATGGACGTCAACGCGCTCACCGGC 

TTCGTGCTCATTCCCCTGGCCTGCTACCTGGTCATCGGCACGTCCTTCATCCTCTCGGGCTTCGT 

GGCCCTGTTCCACATCCGGAGGGTGATGAAGACGGGCGGCGAGAACACGGACAAGCTGGAGA 

AGCTCATGGTGCGTATCGGGCTCTTCTCTGTGCTGTACACCGTGCCGGCCACCTGTGTGATCGCC 

TGCTACTTTTACGAACGCCTCAACATGGATTACTGGAAGATCCTGGCGGCGCAGCACAAGTGCA 

AAATGAACAACCAGACTAAAACGCTGGACTGCCTGATGGCCGCCTCCATCCCCGCCGTGGAGA 

TCTrCATGGTGAAGATCTTTATGCTGCTGGTGGTGGGGATCACCAGCGGGATGTGGATTTGGAC 

CTCCAAGACTCTGCAGTCCTGGCAGCAGGTGTGCAGCCGTAGGTTAAAGAAGAAGAGCCGGAG 

AAAACCGGCCAGCGTGATCACCAGCGGTGGGATTTACAAAAAAGCCCAGCATCCCCAGAAAAC 

TCACCACGGGAAATATGAGATCCCTGCCCAGTCGCCCACCTGCGTGTGAACAGGGCTGGAGGG 

AAGGGCACAGGGGCGCCCGGAGCTAAGATGTGGTGCTTTTCTrGGTTGTGTTTITCTTTOT 

CTTCTTT1 , 1T1 , 1 , 1T1 1 1 1 ATAAAAGC AAAAGAGAAATACATAAAAAAGTGTT T ACCC TG AAATTC 

AGGATGCTGTGATACACTGAAAGGAAAAATCTACTTAAAGGGTTTC^ 

AGCGAAGGGAAGCTCCTCCAGTGAAGTAGCCTCTTGTGTAACTAATTTGTGGTAAAGTAGTTGA 

TTCAGCCCTCAGAAGAAAACTTTTGTTTAGAGCCCTCCGTAAATATACATCTGTGTATTTGAGTT 

GGCTITGCTACCCATrTACAAATAAGAGGACAGATAACTGCTTTGCAAATTCAAGAGCCTCCCC 

TGGGTTAACAAATGAGCCATCCCCAGGGCCCACCCCCAGGAAGGCCACAGTGCTGGGCGGCAT 

CCCTGCAGAGGAAAGACAGGACCCGGGGCCCGCCTCACACCCCAGTGGATTTGGAGTTGCTTA 

AAATAGACTCTGGCCTTCACCAATAGTCTCTCTGCAAGACAGAAACCTCCATCAAACCTCACAT 

TTGTGAACTCAAACGATGTGCAATACATTTTTTTCT^ 

GTATTTrGCTATATATAAAGACAACAAAAGAAATCTCCTAACAAAAGAACTAAGAGGCCCAGC 

CCTCAGAAACCCTTCAGTGCTACATTTTGTGGCTTTTTAATGGAAACCAAGCCAATGTTAT^^ 

CGTTTGGACTGATTTGTGGAAAGGAGGGGGGAAGAGGGAGAAGGATCATTCAAAAGTTACCCA 

AAGGGCTTATTGACTCTTTCTATTGTTAAACAAATGATTTCCACAAACAGATCAGGAAGCACTA 

GGTTGGCAGAGACACTTrGTCTAGTGTATTCTCTTCACAGTGCCAGGAAAGAGTGGTTTCTGCG 

TGTGTATATTTGTAATATATGATATTTTTCATC 

TTAAAAAAA 

Figure 40 

CCTGCAGCCTCCGGAGTCAGTGCCGCGCGCCCGCCGCCCCGCGCCTTCCTGCTCGCCGCACCTC 

CGGGAGCCGGGGCGCACCCAGCCCGCAGCGCCGCCTCCCCGCCCGCGCCGCCTCCGACCGCAG 

GCCGAGGGCCGCCACTGGCCGGGGGGACCGGGCAGCAGCTTGCGGCCGCGGAGCCGGGCAAC 

GCTGGGGACTGCGCCTTTTGTCCCCGGAGGTCCCTGGAAGTTTGCGGCAGGACGCGCGCGGGG 

AGGCGGCGGAGGCAGCCCCGACGTCGCGGAGAACAGGGCGCAGAGCCGGCATGGGCATCGGG 

CGCAGCGAGGGGGGCCGCCGCGGGGCCCTGGGCGTGCTGCTGGCGCTGGGCGCGGCGCTTCTG 

GCCGTGGGCTCGGCCAGCGAGTACGACTACGTGAGCTTCCAGTCGGACATCGGCCCGTACCAG 

AGCGGGCGCTTCTACACCAAGCCACCTCAGTGCGTGGACATCCCCGCGGACCTGCGGCTGTGCC 

ACAACGTGGGCTACAAGAAGATGGTGCTGCCCAACCTGCTGGAGCACGAGACCATGGCGGAGG 

TGAAGCAGCAGGCCAGCAGCTGGGTGCCCCTGCTCAACAAGAACTGCCACGCCGGGACCCAGG 

TCTTCCTCTGCTCGCTCTTCGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCTC 

TGCGAGGCCGTGCGCGACTCGTGCGAGCCGGTCATGCAGTTCTTCGGCTTCTACTGGCCCGAGA 

TGCTTAAGTGTGACAAGTTCCCGGAGGGGGACGTCTGCATCGCCATGACGCCGCCCAATGCCAC 

CGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCCTCCCTGTGACAACGAGTTGAAATCTGA 

GGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGCACTGAGGATGAAAATAAAAGAAGTGAA 

AAAAGAAAATGGCGACAAGAAGATTGTCCCCAAGAAGAAGAAGCCCCTGAAGTTGGGGCCCA 

TCAAGAAGAAGGACCTGAAGAAGCTTGTGCTGTACCTGAAGAATGGGGCTGACTGTCCCTGCC 

ACCAGCTGGACAACCTCAGCCACCACTTCCTCATCATGGGCCGCAAGGTGAAGAGCCAGTACTT 

GCrGACGGCCATCCACAAGTGGGACAAGAAAAACAAGGAGTTCAAAAACTTCATGAAGAAAA 

TGAAAAACCATGAGTGCCCCACCTTTCAGTCCGTGTTTAAGTGATTCTCCCGGGGGCAGGGTGG 

GGAGGGAGCCTCGGGTGGGGTGGGAGCGGGGGGGACAGTGCCCGGGAACCCGTGGTCACACA 

CACGCACTGCCCTGTCAGTAGTGGACATTGTAATCCAGTCGGCTTGTTCTTGCAGCATTCCC 
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CCCTTTCCCTCCATAGCCACGCTCCAAACCCCAGGGTAGCCATGGCCGGGTAAAGCAAGGGCC 

ATTTAGATTAGGAAGGTTTTTAAGATCCGCAATGTGGAGCAGCAGCCACTGCACAGGAGGAGG 

TGACAAACCATTTCCAACAGCAACACAGCCACTAAAACACAAAAAGGGGGATTGGGCGGAAA 

GTGAGAGCCAGCAGCAAAAACTACATITrGCAACTTGTTGGTGTGGATCTATTGGCTGATCTAT 

GCCITTCAACTAGAAAATTCTAATGATTGGCAAGTCACGTTGTTTrCAGGTCCAGAGTAGTT^ 

TTCTGTCTGCTTTAAATGGAAACAGACTCATACCACACTTACAATTAAGGTCAAGCCCAGAAAG 

TGATAAGTGCAGGGAGGAAAAGTGCAAGTCCATTATCTAATAGTGACAGCAAAGGGACCAGGG 

GAGAGGCATTGCCTTCTCTGCCCACAGTCmCCGTGTGATTGTCTTTGAATCTGAATCAGCCAG 

TCTCAGATGCCCCAAAGTTTCGGTTCCTATGAGCCCGGGGCATGATCTGATCCCCAAGACATGT 

GGAGGGGCAGCCTGTGCCTGCCrrTGTGTCAGAAAAAGGAAACCACAGTGAGCCTGAGAGAGA 

CGGCGATTTTCGGGCTGAGAAGGCAGTAGTTTTCAAAACACATAGTTA 



Figure 41 

GAATTCGTTCAGCCTGGTTAAGTCCAAGCTGGCTCATTCTGCTCCCCCGGGTCGGAGCCCCCCG 

GAGCTGCGCGCGGGCTTGCAGCGCCTCGCCCGCGCTGTCCTCCCGGTGTCCCGCTTCTCCGCGC 

CCCAGCCGCCGGCTGCCAGCTTTTCGGGGCCCCGAGTCGCACCCAGCGAAGAGAGCGGGCCCG 

GGACAAGCTCGAACTCCGGCCGCCTCGCCCTTAACCAGCTCCGTCCCTCTACCCCCTAGGGGTC 

GCGCCCACGATGCTGCAGGGCCCTGGCTCGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCT 

GGGCTCGGCGCGCGGGCTCTTCCTCTTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGC 

AAGCCCATCCCGGCCAACCTGCAGCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCC 

AACCTGCTGGGCCACGAGACCATGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTG 

GTCATGAAGCAGTGCCACCCGGACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCC 

TCGATGACCTAGACGAGACCATCCAGCCATGCCACTCTCGNTGCGTGCAGGTGAAGGATCGCT 

GCGCCCCGGTCATGTCCGCCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGA 

CAACGACCTTTGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCA 

AAGGTATGTGAAGCCTGCAAAAATAAAAATGATGATGACAACGACATAATGGAAACGCTTTGT 

AAAAATGATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCAACCGT 



Figure 42 

CCGGGTCGGAGCCCCCCGGAGCTGCGCGCGGGCTTGCAGCGCCTCGCCCGCGCTGTCCTCCCGGTGTCCC 
GCTTCTCCGCGCCCCAGCCGCCGGCTGCCAGCTTTTCGGGGCCCCGAGTCGCACCCAGCGAAGAGAGCGG 
GCCCGGGACAAGCTCGAACTCCGGCCGCCTCGCCCTTCCCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTC 
GCGCGCCCACGATGCTGCAGGGCCCTGGCTCGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGG 
CTCGGCGCGCGGGCTCTTCCTCTtTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATC 
CCTGCCAACCTGCAGCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACG 
AGACCATGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCGGA 
CACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGACCATCCAGCCA 
TGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCGCCTTCGGCTTCCCCTGGC 
CCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTTTGCATCCCCCTCGCTAGCAGCGACCA 
CCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATGTGAAGCCTGCAAAAATAAAAATGATGATGACAAC 
GACATAATGGAAACGCTTTGTAAAAATGATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCA 
ACCGAGATACCAAAATCATCCTGGAGACCAAGAGCAAGACCATTTACAAGCTGAACGGTGTGTCCGAAAG 
GGACCTGAAGAAATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCACCTGTGAGGAGATGAACGACATC 
AACGCGCCCTATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGGTGGC 
AGAAGGGGCAGAGAGAGTTCAAGCGCATCTCCCGCAGCATCCGCAAGCTGCAGTGCTAGTCCCGGCATCC 
TGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCCGGGATCTCAGCTCCCGTTCC 
CCAAGCACACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCTTCCCCCTGCCTTTTGCACGTTTGCATCC 
CCAGCATTTCCTGAGTTATAAGGCCACAGGAGTGGATAGCTGTTTTCACCTAAAGGAAAAGCCCACCCGA 
ATCTTGTAGAAATATTCAAACTAATAAAATCATGAATATTTTTATGAAGTTT 
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ACGGGGCCTGGGCGGSAGGGGCGGTGGCTGGAGCTCGGTAAAGCTCGTGGGACCCCATTGGGG 

GAATTTGATCCAAGGAAGCGGTGATTGCCGGGGGAGGAGAAGCTCCCAGATCCTTGTGTCCAC 

TTGCAGCGGGGGAGGCGGAGACGCGGAGCGGGCCTTTTGGCGTCCACTGCGCGGCTGCACCCT 

GCCCCATCCTGCCGGGATCATGGTCTGCGGCAGCCCGGGAGGGATGCTGCTGCTGCGGGCCGG 

GCTGCTTGCCCTGGCTGCTCTCTGCCTGCTCCGGGTGCCCGGGGCTCGGGCTGCAGCCTGTGAG 

CCCGTCCGCATCCCCCTGTGCAAGTCCCTGCCCTGGAACATGACTAAGATGCCCAACCACCTGC 

ACCACAGCACTCAGGCCAACGCCATCCTGGCCATCGAGCAGTTCGAAGGTCTGCTGGGCACCC 

ACTGCAGCCCCGATCTGCTCTTCTTCCTCTGTGCCATGTACGCGCCCATCTGCACCATTGACTTC 

CAGCACGAGCCCATCAACCCCTGTAAGTCTGTGTGCGAGCGGGCCCGGCAGGGCTGTGAGCCC 

ATACTCATCAAGTACCGCCACTCGTGGCCGGAGAACCTGGCCTGCGAGGAGCTGCCAGTGTAC 

GACAGGGGCGTGTGCATCTCTCCCGAGGCCATCGTTACTGCGGACGGAGCTGATTTTCCTATGG 

ATTCTAGTAACGGAAACTGTAGAGGGGCAAGCAGTGAACGCTGTAAATGTAAGCCTATTAGAG 

CTACACAGAAGACCTATTTCCGGAACAATTACAACTATGTCATTCGGGCTAAAGTTAAAGAGAT 

AAAGACTAAGTGCCATGATGTGACTGCAGTAGTGGAGGTGAAGGAGATTCTAAAGTCCTCTCT 

GGTAAACATTCCACGGGACACTGTCAACCTCTATACCAGCTCTGGCTGCCTCTGCCCTCCACTT 

AATGTTAATGAGGAATATATCATCATGGGCTATGAAGATGAGGAACGTTCCAGATTACTCTTGG 

TGGAAGGCTCTATAGCTGAGAAGTGGAAGGATCGACTCGGTAAAAAAGTTAAGCGCTGGGATA 

TGAAGCTTCGTCATCTTGGACTCAGTAAAAGTGATTCTAGCAATAGTGATTCCACTCAGAGTCA 

GAAGTCTGGCAGGAACTCGAACCCCCGGCAAGCACGCAACTAAATCCCGAAATACAAAAAGTA 

ACACAGTGGACTTCCTATTAAGACTTACITGCATTGCTGGACTAGCAAAGGAAAATTGCACTAT 

TGCACATCATATTCTATTGTTTACTATAAAAATCATGTGATAACTGATTATTACTTCTGTTTCTCT 

TTTGGTTTCTGCTTCTCTCITCrCTCAACCCCm 

TTGTGAGTTTTCTATTTCACTAATCATGAGAAAAACTGTTCTTTTGCAATAATAATAA^ 
ATGCTGTTA 

Figure 44 

CAGCGGCCGCTGAATTCTAGGGCGGGTTCGCGCCCCGAAGGCTGAGAGCTGGCGCTGCTCGTG 

CCCTGTGTGCCAGACGGCGGAGCTCCGCGGCCGGACCCCGCGGCCCCGCTTTGCTGCCGACTGG 

AGTTTGGGGGAAGAAACTCTCCTGCGCCCCAGAAGATTTCTTCCTCGGCGAAGGGACAGCGAA 

AGATGAGGGTGGCAGGAAGAGAAGGCGCTTTCTGTCTGCCGGGGTCGCAGCGCGAGAGGGCA 

GTGCCATGTTCCTCTCCATCCTAGTGGCGCTGTGCCTGTGGCTGCACCTGGCGCTGGGCGTGCG 

CGGCGCGCCCTGCGAGGCGGTGCGCATCCCTATGTGCCGGCACATGCCCTGGAACATCACGCG 

GATGCCCAACCACCTGCACCACAGCACGCAGGAGAACGCCATCCTGGCCATCGAGCAGTACGA 

GGAGCTGGTGGACGTGAACTGCAGCGCCGTGCTGCGCTTCTTCTTCTGTGCCATGTACGCGCCC 

ATTTGCACCCTGGAGTTCCTGCACGACCCTATCAAGCCGTGCAAGTCGGTGTGCCAACGCGCGC 

GCGACGACTGCGAGCCCCTCATGAAGATGTACAACCACAGCTGGCCCGAAAGCCTGGCCTGCG 

ACGAGCTGCCTGTCTATGACCGTGGCGTGTGCATTTCGCCTGAAGCCATCGTCACGGACCTCCC 

GGAGGATGTTAAGTGGATAGACATCACACCAGACATGATGGTACAGGAAAGGCCTCTTGATGT 

TGACTGTAAACGCCTAAGCCCCGATCGGTGCAAGTGTAAAAAGGTGAAGCCAACTTTGGCAAC 

GTATCTCAGCAAAAACTACAGCTATGTTATTCATGCCAAAATAAAAGCTGTGCAGAGGAGTGG 

CTGCAATGAGGTCACAACGGTGGTGGATGTAAAAGAGATCTTCAAGTCCTCATCACCCATCCCT 

CGAACTCAAGTCCCGCTCATTACAAATTCTTCTTGCCAGTGTCCACACATCCTGCCCCATCAAG 

ATGTTCTCATCATGTGTTACGAGTGGCGTTCAAGGATGATGCTTCTTGAAAATTGCTTAGTTGAA 

AAATGGAGAGATCAGCTTAGTAAAAGATCCATACAGTGGGAAGAGAGGCTGCAGGAACAGCG 

GAGAACAGTTCAGGACAAGAAGAAAACAGCCGGGCGCACCAGTCGTAGTAATCCCCCCAAACC 

AAAGGGAAAGCCTCCTGCTCCCAAACCAGCCAGTCCCAAGAAGAACATTAAAACTAGGAGTGC 

CCAGAAGAGAACAAACCCGAAAAGAGTGTGAGCTAACTAGTTTCCAAAGCGGAGACTTCCGAC 

TTCCTTACAGGATGAGGCTGGGCATTGCCTGGGACAGCCTATGTAAGGCCATGTGCCCCTTGCC 

CTAACAACTCACTGCAGTGCTCTTCATAGACACATCTTGCAGCATTITrCTrAAGGCTATGCTTC 

AGTTTTTCTTTGTAAGCCATCACAAGCCATAGTGGTAGGTTTGCCCTTTGGTACAGAAGGTGAG 

TTAAAGCTGGTGGAAAAGGCTTATTGCATTGCATTCAGAGTAACCTGTGTGCATACTCTAGAAG 

AGTAGGGAAAATAATGCTTGTTACAATTCGACCTAATATGTGCATTGTAAAATAAATGCCATAT 

TTCAAACAAAACACGTAATTTTTTTACAGTATGTTTTATTACCTm 

GTrAGTCATGTTTTAAAATGTGATGAAAATATAATGTTTTTAAGAA 

ATGTTAAAAGATCTTTATGTGTTTATGGTCTGCAGAAGGATTTTTGTGATGAAAGGGGAl 1 1 1 1 T 



WO 03/012082 



PCT/GB02/03409 



37/41 

GAAAAATTAGAGAAGTAGCATATGGAAAATTATAATGTGTT^^ 

TGTTTITAGCTAGAAAOTAAAAACAAAAATAATAATAAAGAAAAATAAATAAAAAGGAGAGG 

CAGACAATGTCTGGATTCCTGTITmGGTTACCTGATTrCCATGATCATGATGCTTCTTGTCAA 

CACCCTCTTAAGCAGCACCAGAAACAGTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAATT 

AGTTGGCTAATGCTCAAGTATITrATACCCACAAGAGAGGTATGTCACrCATCTrACTTCCCAG 

GACATCCACCCTGAGAATAATTTGACAAGCTTAAAAATGGCCTTCATGTGAGTGCCAAATTTTG 

TTTTTCTrCATTTAAATATTTT^ 

GAGAGGAAAGTTGAGTTCCACCTCTGAAATGAGAATTACTTGACAGTTGGGATACTTTAATCAG 

AAAAAAAGAACTTATTTGCAGCATTTTATCAACAAATITCATAATTGTGGACAATTGGAGGCAT 

TTATTTTAAAAAACAATTTTATTGGCCTTTTGCTAACACAGTAAGCATGTA 

CAATAAATGCACAACGCCCAAAGGAAATAAAATCCTATCTAATCCTACTCTCCACTACACAGA 

GGTAATCACTATTAGTATTTrGGCATATTATTCTCCAGGTGTTTGCTTATGCACTTATAAAATGA 

TITGAACAAATAAAACTAGGAACCTGTATACATGTGTTTCATAACCTGCCTCCTTTGCTTGGCCC 

TTTATTGAGATAAGTTTTCCTGTCAAGAAAGCAGAAACCATCTCATTTCTAACAGCTGTGTTATA 

TrCCATAGTATGCATTACTCAACAAACTGTTGTGCTATTGGATACTTAGGTGGTTTCTTCACTGA 

CAATACTGAATAAACATCTCACCGGAATTC 

Figure 45 

AAGCTTGATATCGAATTCGCGGCCGCGTCGACGGGAGGCGCCAGGATCAGTCGGGGCACCCGC 

AGCGCAGGCTGCCACCCACCTGGGCGACCTCCGCGGCGGCGGCGGCGGCGGCTGGGTAGAGTC 

AGGGCCGGGGGCGCACGCCGGAACACCTGGGCCGCCGGGCACCGAGCGTCGGGGGGCTGCGC 

GGCGCGACCCTGGAGAGGGCGCAGCCGATGCGGGCGGCGGCGGCGGCGGGGGGCGTGCGGAC 

GGCCGCGCTGGCGCTGCTGCTGGGGGCGCTGCACTGGGCGCCGGCGCGCTGCGAGGAGTACGA 

CTACTATGGCTGGCAGGCCGAGCCGCTGCACGGCCGCTCCTACTCCAAGCCGCCGCAGTGCCTT 

GACATCCCTGCCGACCTGCCGCTCTGCCACACGGTGGGCTACAAGCGCATGCGGCTGCCCAA.ee 

TGCTGGAGCACGAGAGCCTGGCCGAAGTGAAGCAGCAGGCGAGCAGCTGGCTGCCGCTGCTGG 

CCAAGCGCTGCCACTCGGATACGCAGGTCTTCCTGTGCTCGCTCTTTGCGCCCGTCTGTCTCGAC 

CGGCCCATCTACCCGTGCCGCTCGCTGTGCGAGGCCGTGCGCGCCGGCTGCGCGCCGCTCATGG 

AGGCCTACGGCITCCCCTGGCCrGAGATGCTGCACTGCCACAAGTTCCCCCTGGACAACGACCT 

CTGCATCGCCGTGCAGTTCGGGCACCTGCCCGCCACCGCGCCTCCAGTGACCAAGATCTGCGCC 

CAGTGTGAGATGGAGCACAGTGCTGACGGCCTCATGGAGCAGATGTGCTCCAGTGACTTTGTG 

GTCAAAATGCGCATCAAGGAGATCAAGATAGAGAATGGGGACCGGAAGCTGATTGGAGCCCA 

GAAAAAGAAGAAGCTGCTCAAGCCGGGCCCCCTGAAGCGCAAGGACACCAAGCGGCTGGTGC 

TGCACATGAAGAATGGCGCGGGCTGCCCCTGCCCACAGCTGGACAGCCTGGCGGGCAGCTTCC 

TGGTCATGGGCCGCAAAGTGGATGGACAGCTGCTGCTCATGGCCGTCTACCGCTGGGACAAGA 

AGAATAAGGAGATGAAGTTTGCAGTCAAATTCATGTTCTCCTACCCCTGCTCCCTCTACTACCCT 

TTCTTCTACGGGGCGGCAGAGCCCCACTGAAGGGCACTCCTCCTTGCCCTGCCAGCTGTGCCTT 

GCTTGCCCTCTGGCCCCGCCCCAACnTCCAGGCTGACCCGGCCCTACTGGAGGGTGTTTTCACG 

AATGTTGTTACTGGCACAAGGCCrAAGGGATGGGCACGGAGCCCAGGCTGTCCTTTTTGACCCA 

GGGGTCCTGGGGTCCCTGGGATGTTGGGCTTCCTCTCTCAGGAGCAGGGOTCTTCATCTGGGT 

GAAGACCTCAGGGTCTCAGAAAGTAGGCAGGGGAGGAGAGGGTAAGGGAAAGGTGGAGGGGC 

TCAGGGCACCCTGAGGCGGAGGTTTCAGAGTAGAAGGTGATGTCAGCTCCAGCTCCCCTCTGTC 

GGTGGTGGGGCCrCACCTTGAAGAGGGAAGTCTCAATATTAGGCTAAGCTATTTGGGAAAGTTC 

TCCCCACCGCCCCTGTACGCGTCATCCTAGCCCCCCTTAGGAAAGGAGTTAGGGTCTCAGTGCC 

TCCAGCCACACCCCCrGCCTTCCCCAGCTTGCCCATTTCCCTGCCCCAAGGCCCAGAGCTCCCCC 

CAGACTGGAGAGCAAGCCCAGCCCAGCCTCGGCATAGACCCCCTTCTGGTCCGCCCGTGGCTCG 

ATTCCCGGGATTCATTCCTCAGCCTCTGCTTCnCCCT 

TGAGGCCATAGGTACTAGACAACCAATACATGCAGGGTTGGGTTTTCTAATTTTTT^ 
AATTAAATCAAAGGTCGACGCGCGGCCGCGGAATTCCTGCAGCCCGGGGGATCCCCGGGTACC 

GAGCTCGAATTC 



WO 03/012082 



PCT/GB02/03409 



Figure 46 38 ' 4 1 

ATGCATCTCCTCTTATTTCAGCTGCTGGTACTCCTGCCTCTAGGAAAGACCACACGGCACCAGG 

ATGGCCGCCAGAATCAGAGTTCTCTTTCCCCCGTACTCCTGCCAAGGAATCAAAGAGAGCTTCC 

CACAGGCAACCATGAGGAAGCTGAGGAGAAGCCAGATCTGTTTGTCGCAGTGCCACACCTTGT 

AGCCACCAGCCCTGCAGGGGAAGGCCAGAGGCAGAGAGAGAAGATGCTGTCCAGATTTGGCA 

GGTTCTGGAAGAAGCCTGAGAGAGAAATGCATCCATCCAGGGACTCAGATAGTGAGCCCrrCC 

CACCTGGGACCCAGTCCCTCATCCAGCCGATAGATGGAATGAAAATGGAGAAATCTCCTCTTCG 

GGAAGAAGCCAAGAAATTCTGGCACCACTTCATGTTCAGAAAAACTCCGGCTTCTCAGGGGGT 

CATCTTGCCCATCAAAAGCCATGAAGTACATTGGGAGACCTGCAGGACAGTGCCCTTCAGCCA 

GACTATAACCCACGAAGGCTGTGAAAAAGTAGTTGTTCAGAACAACCTTTGCTTTGGGAAATGC 

GGGTCTGTTCATTTTCCTGGAGCCGCGCAGCACTCCCATACCTCCTGCTCTCACTGTTTGCCTGC 

CAAGTTCACCACGATGCACTrGCCACTGAACTGCACTGAACTTTCCTCCGTGATCAAGGTGGTG 

ATGCTGGTGGAGGAGTGCCAGTGCAAGGTGAAGACGGAGCATGAAGATGGACACATCCTACAT 

GCTGGCTCCCAGGATTCCTTTATCCCAGGAGTTTCAGCTTGA 



Figure 47 

CGGCACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCT 

TGAGTCCITCTGAGATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGT 

AGCGGCGGCTCrCGGCGGCCACCCTCrGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAAT 

TCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTCTGCA 

GTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACAACTAC 

CAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGTCCCACC 

CGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATG 

CGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAA 

ATCATTTCCGAGGAGAAATrGAGGAAACCATCACTGAAAGCTTTGGTAATGATCATAGCACCTT 

GGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAGG 

TrCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCA 

AGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTC 

ATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGA 

TCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACACTAAACCAGCTATCCA 

AATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACCTTTTATGACCTTCATCAACTCAA 

TCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCATTCCAATAACACCTrCCAAAAACCTG 

GAGTGTAAGAGCTTTGTTTCTITATGGAACTCCCCTGTGATTGCAGT AAAT TACTGTATTGTAAA 

TTCTCAGTGTGGCACTTACCTGTAAATGCAATGAAACTITrAATTATTTTTCTA 

CTGCCTATTTTTCCTCITGTTATGTAAATTTTTGTACACATT 

TTCTATATTGAACTGAAGTAAATCATTTCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCA 
TTTAATTCTAGAGTCTAGAACGCAAGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGT 
AACATGAAAATACTAGCTTATTTTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCm 
AGGCTGTGATAGTTTTTGAAATAAAATTTAACATTTAATATCATGAAATGTTATAAGTAGACAT 



Figure 48 

GCGGGTCTCGCTTGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGGGTCCT 

GGGTCCCCGAACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGACCCAAGTGAGG 

GGCCCCGTGTTGGGGTCCTCCCTCCCTTTGCATTCCCACCCCTCCGGGCTTTGCGTCTTCCTGGG 

GACCCCCTCGCCGGGAGATGGCCGCGTTGATGCGGAGCAAGGATTCGTCCTGCTGCCTGCTCCT 

ACTGGCCGCGGTGCTGATGGTGGAGAGCTCACAGATCGGCAGTTCGCGGGCCAAACTCAACTC 

CATCAAGTCCTCTCTGGGCGGGGAGACGCCTGGTCAGGCCGCCAATCGATCTGCGGGCATGTAC 

CAAGGACTGGCATTCGGCGGCAGTAAGAAGGGCAAAAACCTGGGGCAGGCCTACCCTTGTAGC 

AGTGATAAGGAGTGTGAAGTTGGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGC 

ATGGTGTGTCGGAGAAAAAAGAAGCGCTGCCACCGAGATGGCATGTGCTGCCCCAGTACCCGC 

TGCAATAATGGCATCTGTATCCCAGTTACTGAAAGCATCTTAACCCCTCACATCCCGGCTCTGG 

ATGGTACTCGGCACAGAGATCGAAACCACGGTCATTACTCAAACCATGACTTGGGATGGCAGA 

ATCTAGGAAGACCACACACTAAGATGTCACATATAAAAGGGCATGAAGGAGACCCCTGCCTAC 

GATCATCAGACTGCATTGAAGGGTTTTGCTGTGCTCGTCATTTCTGGACCAAAATCTGCAAACC 
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AGTGCTCCATCAGGGGGAAGTCTGTACCAAACAACGCAAGAAGGGTTCTCATGGGCTGGAAAT 

TTTCCAGCGTTGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATGGAAAGATGCCACCTACTCC 

TCCAAAGCCAGACTCCATGTGTGTCAGAAAATTTGATCACCATTGAGGAACATCATCAATTGCA 

GACTGTGAAGTTGTGTATTTAATGCATTATAGCATGGTGGAAAATAAGGTTCAGATGCAGAAG 

AATGGCTAAAATAAGAAACGTGATAAGAATATAGATGATCAC 



Figure 49 

CTATCACAATGAGACCAACACAGACACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGA 

AATTCACAAGATAACCAACAACCAGACTGGACAAATGGTCTTTTCAGAGACAGTTATCACATCT 

GTGGGAGACGAAGAAGGCAGAAGGAGCCACGAGTGCATCATCGACGAGGACTGTGGGCCCAG 

CATGTACTGCCAGTTTGCCAGCTTCCAGTACACCTGCCAGCCATGCCGGGGCCAGAGGATGCTC 

TGCACCCGGGACAGTGAGTGCTGTGGAGACCAGCTGTGTGTCTGGGGTCACTGCACCAAAATG 

GCCACCAGGGGCAGCAATGGGACCATCTGTGACAACCAGAGGGACTGCCAGCCGGGGCTGTGC 

TGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCTTT 

GCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATGGAGCCTT 

GGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGCCCCACAGCCACAGCCTGGTGTATGTG 

TGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGGGAGATCCTGCTGCCCAGAGAGGTC 

CCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCGCCAGGAGCTGGAGGACCTGGAG 

AGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTGCCGCCGCTGCACTGCTGGGA 

GGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAATAGAAATAGCTAATTTAT 

TTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTACATCTTCTTCCCAGTAAG 

TTTCCCCTCTGGCTrGACAGCATGAGGTGTTGTGCATTTGTTCAGCTCCCCCAGGCTGTTCTCCA 

GGCTTCACAGTCTGGTGCITGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTTGCCACC 

CCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCGTTTGTTCT ACAT GGCT 

TTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTCTGATTGGTTTTGGGG 

AAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATGCAACAAATGAATT 

TTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTrCAGCTGTTGCAGATGAAATGTTC 

TGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCCTACCTCTGTGCCA 

GGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGGAGGGGGTCATTG 

TTCTCCTCGTCCATCAGGGATCTCAGAGGNCTCAGAGACTGCAAGCTGCTTGCCCAAGTCACAC 

AGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCTCTCCACTAC 

CCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGATTTTTCTTT 

TGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAACTACT 

GTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGATATTG 

ACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCATAC 

AGGTTAACCTGCAGAAACAGTACITAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGCA 

AAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGG 

CTGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGA 

TGTTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTG 

CACATGATTGTATAAGCATGCTTTCTTTC^ 

AAATCAAGCATAAATCAC 



Figure 50 

AGACGACGTGCTGAGCTGCCAGCTTAGTGGAAGCTCTGCTCTGGGTGGAGAGCAGCCTCGCTTT 

GGTGACGCACAGTGCTGGGACCCTCCAGGAGCCCCGGGATTGAAGGATGGTGGCGGCCGTCCT 

GCTGGGGCTGAGCTGGCTCTGCTCTCCCCTGGGAGCTCTGGTCCTGGACTTCAACAACATCAGG 

AGCTCTGCTGACCTGCATGGGGCCCGGAAGGGCTCACAGTGCCTGTCTGACACGGACTGCAAT 

ACCAGAAAGTTCTGCCTCCAGCCCCGCGATGAGAAGCCGTTCTGTGCTACATGTCGTGGGTTGC 

GGAGGAGGTGCCAGCGAGATGCCATGTGCTGCCCTGGGACACTCTGTGTGAACGATGTTTGTAC 

TACGATGGAAGATGCAACCCCAATATTAGAAAGGCAGCTTGATGAGCAAGATGGCACACATGC 

AGAAGGAACAACTGGGCACCCAGTCCAGGAAAACCAACCCAAAAGGAAGCCAAGTATTAAGA 

AATCACAAGGCAGGAAGGGACAAGAGGGAGAAAGTTGTCTGAGAACTTTTGACTGTGGCCCTG 
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GACTTTGCTGTGCTCGTCATTTTTGGACGAAAA^ 

CTGCTCCAGAAGAGGGCATAAAGACACTGCTCAAGCTCCAGAAATCTTCCAGCGTTGCGACTGT 
GGCCCTGGACTACTGTGTCGAAGCCAATTGACCAGCAATCGGCAGCATGCTCGATTAAGAGTAT 
GCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAATCCACATTGC 



Figure 51 

AGGCAGAATACTTCTATGAATTCCTGTCCTTGCGCTCCCTGGATAAAGGCATCATGGCAGATCC 

AACCGTCAATGTCCCTCTGCTGGGAACAGTGCCTCACAAGGCATCAGTTGTTCAAGTTGGTTTC 

CCATGTCTTGGAAAACAGGATGGGGTGGCAGCATTTGAAGTGGATGTGATTGTTATGAATTCTG 

AAGGCAACACCATTCTCCAAACACCTCAAAATGCTATCTTCITrAAAACATGTCAACAAGCTGA 

GTGCCCAGGCGGGTGCCGAAATGGAGGCTTTTGTAATGAAAGACGCATCTGCGAGTGTCCTGA 

TGGGTTCCACGGACCTCACTGTGAGAAAGCCCTTTGTACCCCACGATGTATGAATGGTGGACTT 

TGTGTGACTCCTGGTTTCTGCATCTGCCCACCTGGATTCTATGGAGTGAACTGTGACAAAGCAA 

ACTGCTCAACCACCTGCTITAATGGAGGGACCTGTTTCTACCCTGGAAAATGTATTTGCCCTCCA 

GGACTAGAGGGAGAGCAGTGTGAAATCAGCAAATGCCCACAACCCTGTCGAAATGGAGGTAA 

ATGCATrGGTAAAAGCAAATGTAAGTGTTCCAAAGGTTACCAGGGAGACCTCTGTTCAAAGCCT 

GTCTGCGAGCCTGGCTGTGGTGCACATGGAACCTGCCATGAACCCAACAAATGCCAATGTCAA 

GAAGGTTGGCATGGAAGACACTGCAATAAAAGGTACGAAGCCAGCCTCATACATGCCCTGAGC 

GCAGCAGCGCCCAGCTCAGGCAGCACACGCCTTCACTTAAAAAGGCCGAGGAGCGGCGGCATC 

CACCTGAATCCAATTACATCTGGTGAACTCCGACATCTGAAACGTTTTAAGTTACACCAAGTTC 

ATAGCCTTTGTTAACCTTTCATGTGTTGAATGTTCAAATAATGTTCATTACACTrAAGAATACTG 

GCCTGAATTTTATTAGCTTCATTATAAATCACTGAG 

AAGTACGTCTGTAGCATGATGGTATAGATTTTCTTGTTTCAGTGCTTTGGGACAGATTT^ 

ATGTCAATTGATCAGGTTAAAATTITCAGTGTGTAGTTGGCAGATATTTTCAAAATTACAATGC 

ATTTATGGTGTCTGGGGGCAGGGGAACATCAGAAAGGTTAAATTGGGCAAAAATGCGTAAGTC 

ACAAGAATTTGGATGGTGCAGTrAATGTTGAAGTTACAGCATTTCAGATTTTATTGTCAGATAT 

TTAGATGTTTGTTACATTTTTAAAAATTGCTCTTAATTTTTAAACTCT 

CCTTACCATTATTCCAGAGATTCAGTATTAAAAAAAAAAAAATTACACTGTGGTAGTGGCATTT 
AAACAATATAATATATTCTAAACACAATGAAATAGGGAATATAATGTATGAACTTTTTGCATTG 
GCTTGAAGCAATATAATATATTGTAAACAAAACACAGCTCTTACCTAATAAACATTTTATACTG 
TTTGTATGTATAAAATAAAGGTGCTGCTTTAGTTTTC 



Figure 52 

ATGGGCATCGGGCGCAGCGAGGGGGGCCGCCGCGGGGCAGCCCTGGGCGTGCTGCTGGCGCTGGGCGCGG 
CGCTTCTGGCCGTGGGCTCGGCCAGCGAGTACGACTACGTGAGCTTCCAGTCGGACATCGGCCCGTACCA 
GAGCGGGCGCTTCTACACCAAGCCACCTCAGTGCGTGGACATCCCCGCGGACCTGCGGCTGTGCCACAAC 
GTGGGCTACAAGAAGATGGTGCTGCCCAACCTGCTGGAGCACGAGACCATGGCGGAGGTGAAGCAGCAGG 
CCAGCAGCTGGGTGCCCCTGCTCAACAAGAACTGCCACGCCGGCACCCAGGTCTTCCTCTGCTCGCTCTT 
CGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCTCTGCGAGGCCGTGCGCGACTCGTGC 
GAGCCGGTCATGCAGTTCTTCGGCTTCTACTGGCCCGAGATGCTTAAGTGTGACAAGTTCCCCGAGGGGG 
ACGTCTGCATCGCCATGACGCCGCCCAATGCCACCGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCC 
TCCCTGTGACAACGAGTTGAAATCTGAGGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGGGCTGAGT 
TTAAAGATGATTGTGGGTAGCTCCCATAACTCATGCTGCACGCTGGGTCCTTCTCATCCCAACTCCTCAA 
AGCGGCAGGAGCAGGAACTGGGGACTCCTGAGAGAAGGCTTGGATATGGCCTTTTATTACACTTCATCCA 
AGGAAATCTGCCCCCACCCTGTGCCCAGGCCCGATCACGCATGAGGCTAAAGACGGAGGCCACTCCGCTG 
GCTCTGGGTAGATCTGCCCCTGGACTGTTTGCCGACTGCCCGGAGCGCCCTCTGCCGGTCTGCAGCTTCC 
CACACCACACGGAAGAAGTGGGGAAACTGAGGATACATTCTTTCCTCCTCCAGGTAAAGGGATTCTCAAT 
GAAGGGCTTGTGTGCACCTTCCACACTTAGATACCTCTACTACCTGAAAACCAGCATGCAGCATGTACAT 
CAAGAGTACCAGGCACATAGTGCTCAAGTCTGGGCTAATATGCCACCTGCAGAGAGATGTAAAGATGAAG 
AAGACAAAGCCATGTTTT CAAAGTGA 
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GGCGGGTTCGCGCCCCGAAGGCTGAGAGCTGGCGCTGCTCGTGCCCTGTGTGCCAGACGGCGGAGCTCCG 

CGGCCGGACCCCGCGGCCCCGCTTTGCTGCCGACTGGAGTTTGGGGGAAGAAACTCTCCTGCGCCCCAGA 

AGATTTCTTC CTCGGCGAAGGGACAGCGAAAGATGAGGGTGGCAGGAAGAGAAGGCG CTTTCTGT CTGC C 

GGGGTCGCAGCGCGAGAGGGCAGTGCCATGTTCCTCTCCATCCTAGTGGCGCTGTGCCTGTGGCTGCACC 

TGGCGCTGGGCGTGCGCGGCGCGCCCTGCGAGGCGGTGCGCATCCCTATGTGCCGGCACATGCCCTGGAA 

CATC^CGCGGATGCCCAACC^CCTG(^CCACAGCACGCAGGAGAACGCCATCCTGGCCATCGAGCAGTA^ 

GAGGAGCTGGTGGACGTGAACTGCAGCGCCGTGCTGCGCTTCTTCTTCTGTGCCATGTACGCGCCCATTT 

GCACCCTGGAGTTCCTGCACGACCCTATCAAGCCGTGCAAGTCGGTGTGCCAACGCGCGCGCGACGACTG 

CGAGCCCCTCATGAAGATGTACAACCACAGCTGGCCCGAAAGCCTGGCCTGCGACGAGCTGCCTGTCTAT 

GACCGTGGCGTGTGCATTTCGCCTGAAGCCATCGTCACGGACCTCCCGGAGGATGTTAAGTGGATAGACA 

TCACACCAGACATGATGGTACAGGAAAGGCCTCTTGATGTTGACTGTAAACGCCTAAGCCCCGATCGGTG 

CAAGTGTAAAAAGGTGAAG CCAACTTTGGGAACGTATCT CAGCAAAAACTACAGCTATGTTA 

AAAATAAAAGCTGTG CAGAGGAGTGGCTGCAATGAGGTCACAACGGTGGTGGATGT AAAAGAGAT CTT CA 

AGTCCTCATCACCCATCCCTCGAACTCAAGTCCCGCTCATTACAAATTCTTCTTGCCAGTGTCCACACAT 

CCTGCCCCATCAAGATGTTCTCATCATGTGTTACGAGTGGCGTTCAAGGATGATGCTTCTTGAAAATTGC 

TTAGTTGAAAAATGGAGAGATCAGCTTAGTAAAAGATCCATACAGTGGGAAGAGAGGCTGCAGGAACAGC 

GGAGAACAGTTCAGGACAAGAAGAAAACAGC CGGGCGCAC CAGT CGTAGTAATCCCC C CAAACCAAAGGG 

AAAGCCTCCTGCT CC CAAACCAGCC AGTC CCAAGAAGAACATTAAAACTAGGAGTGCCCAGAAGAGAACA 

AACCCGAAAAGAGTGTGAGCTAACTAGTTTC CAAAGCGGAGACTT C CGACTTC CTTACAGGATGAGGCTG 

GGCATTGCCTGGGACAGCCTATGTAAGGCCATGTGCCCCTTGCCCTAACAACTCACTGCAGTGCTCTTCA 

TAGACACATCTTGCAGCATTTTTCTTAAGGCTATGCTTCAGTTTTTCTTTGTAAGCCATCACAAGCCATA 

GTGGTAGGTTTGCCCTTTGGTACAGAAGGTGAGTTAAAGCTGGTGGAAAAGGCTTATTGCATTGCATTCA 

GAGTAACCTGTGTGCATACTCTAGAAGAGTAGGGAAAATAATGCTTGTTACAATTCGACCTAATATGTGC 

ATTGTAAAATAAATGCCATATTTCAAACAAAACACGTAATTTTTTTACAGTATGTTTTATTACCTTTTGA 

TATCTGTTGTTGCAATGTTAGTGATGTTTTAAAATGTGATGAAAATATAATGTTTTTAAGAAGGAACAGT 

AGTGGAATGAATGTTAAAAGATCTTTATGTGTTTATGGTCTGCAGAAGGATTTTTGTGATGAAAGGGGAT 

TTTTTGAAAAATTAGAGAAGTAGCATATGGAAAATTATAATGTGTTTTTTTACCAATGACTTCAGTTTCT 

GTTTTTAGCTAGAAACTTAAAAACAAAAATAATAATAAAGAAAAATA7VATAAAAAGGAGAGGCAGACAAT 

GTCTGGATTCCTGTTTTTTGGTTACCTGATTTCCATGATGATGATGCTTCTTGTCAACACCCTCTTAAGC 

AGCACCAGAAACAGTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAATTAGTTGGCTAATGCTCAAGT 

ATTTTATACCCACAAGAGAGGTATGTCACTCATCTTACTTCCCAGGACATCCACCCTGAGAATAATTTGA 

CAAGCTTAAAAATGGCCTTCATGTGAGTGCCAAATTTTGTTTTTCTTCATTTAAATATTTTCTTTGCCTA 

AATACATGTGAGAGGAGTTAAATATAAATGTACAGAGAGGAAAGTTGAGTTCCACCTCTGAAATGAGAAT 

TACTTGACAGTTGGGATACTTTAATCAGAAAAAAAGAACTTATTTGCAGCATTTTATCAACAAATTTCAT 

AATTGTGGACAATTGGAGGCATTTATTTTAAAAAACAATTTTATTGGCCTTTTGCTAACACAGTAAGCAT 

GTATTTTATAAGGGATTCAATAAATGCACAACGCCCAAAGGAAATAAAATCCTATCTAATCCTACTCTCC 

ACTACACAGAGGTAATCACTATTAGTATTTTGGCATATTATTCTCCAGGTGTTTGCTTATGCACTTATAA 

AATGATTTGAACAAATAAAACTAGGAACCTGTATACATGTGTTTCATAACCTGCCTCCTTTGCTTGGCCC 

TTTATTGAGATAAGTTTTCCTGTCAAGAAAGCAGAAAC CATCT CATTT CTAACAGCTGTGTTATATT CCA 

TAGTATGCATTACTCAACAAACTGTTGTGCTATTGGATACTTAGGTGGTTTCTTCACTGACAATACTGAA 

TAAACATCTCACCGGAATTC 

Figure 54 

GAGGCGCCTTGGGACCGCGTGGGAGCCGCAGCCGAACCGAGTAGGGACCGGGACCGCGCGGCGCCGCCG 

TCCCCGGCCGGGCCCGGCCCCCGCGAGCCGAGCGCGCGCCCCCGTCGCCCACCCGGGCGCGGCTGGATGC 

GGCGGGGTCCCCGCGGCGGCGACCCCCGGCCCCGAGCGCCCGGAGCGCCCAGAGGCGGCGTGCGGGGCC 

CGGGGACGCCGCGCCCTSTBGTGCGCCGAGGCGCGCCCCGAGACAGCCGGGGGCCCGCGCCGCAGCCGC 

CGCCCGCGCTGAGCCCCGGCCCGGCCCGCGGCCCGCGCCCGGCGGCAGCNTGAGCCAGGCCGAGCTGTC 

CACCTGCTCCGCGCCGCAGACGCAGCGCATCTrCCAGGAGGCTGTGCGCNAGGGCAACACGCAGGAGCT 

GCAGTYGCTGCTGCAGAACATGACCAACTGCGAGTT(^CGTGAACTCGTTCGGGCCCGAGGGCCAGAC 

GGCGCTGCACCAGTCGGTCATCGTCGGCAACCTGGTGCTCGTGAAGCTGCTGGTCAAGTTCGGCGCCGAC 

ATCCGCCTGGCCAACCGCGACGGCTGGAGCGCGCTGCAMATCGCCGCGTTCGGTGGCCACCAGGACATC 

GTGCTCTATCTCATCACCAAGGCGAAGTACGCGGCCAGCGCSGGTGTATGCCCGCCGGGACCCCGGACCC 

CGGCCCTGCGCCCGCGTCGTCTCTGCTGTACCTTCCCGCCAACTACCTCGGTGCGCGCMCGGCTCGCAGG 

CCCCGCCAGAAGGCCCGTGGCAACGGCGAATACGGCGCGTGCGTCMCGGCCCCAGGGTC 



